-
Notifications
You must be signed in to change notification settings - Fork 89
/
index.qmd
217 lines (192 loc) · 8.45 KB
/
index.qmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
# Preface {.unnumbered}
```{r echo=FALSE,eval=FALSE}
knitr::write_bib(c(
"abind",
"classInt",
"caret",
"colorspace",
"dbscan",
"dbscan",
"ecmwfr",
"gdalcubes",
"gdalUtilities",
"ggplot2",
"ggspatial",
"gstat",
"hglm",
"HSAR",
"igraph",
"INLA",
"lme4",
"lwgeom",
"mapsf",
"mapview",
"Matrix",
"mgcv",
"osmar",
"osmdata",
"R2BayesX",
"raster",
"RColorBrewer",
"rgee",
"rgeoda",
"rnaturalearth",
"rstac",
"s2",
"sfnetworks",
"sf",
"sftime",
"sp",
"spacetime",
"spatialreg",
"spatstat",
"spData",
"spdep",
"splm",
"stars",
"stcos",
"stplanr",
"stpp",
"terra",
"tidyverse",
"tmap",
"tsibble",
"units",
"viridis"
), "packages.bib", width = 60)
```
Data science is concerned with finding answers to questions on the
basis of available data, and communicating that effort. Besides
showing the results, this communication involves sharing the data
used, but also exposing the path that led to the answers in a
comprehensive and reproducible way. It also acknowledges the fact
that available data may not be sufficient to answer questions, and
that any answers are conditional on the data collection or sampling
protocols employed.
\index{data science}
\index{spatial data science}
This book introduces and explains the concepts underlying
_spatial_ data: points, lines, polygons, rasters, coverages, geometry
attributes, data cubes, reference systems, as well as higher-level
concepts including how attributes relate to geometries and how
this affects analysis. The relationship of attributes to geometries
is known as support, and changing support also changes the
characteristics of attributes. Some data generation processes are
continuous in space, and may be observed everywhere. Others are
discrete, observed in tesselated containers. In modern spatial data
analysis, tesellated methods are often used for all data, extending
across the legacy partition into point process, geostatistical and
lattice models. It is support (and the understanding of support) that
underlies the importance of spatial representation. The book aims
at data scientists who want to get a grip on using spatial data
in their analysis. To exemplify how to do things, it uses R.
In future editions we hope to extend this with examples using Python
[see, e.g., @bivand:22] and Julia.
It is often thought that spatial data boils down to having
observations' longitude and latitude in a dataset, and treating these
just like any other variable. This carries the risk of missed
opportunities and meaningless analyses. For instance,
* coordinate pairs really are pairs, and lose much of their meaning
when treated independently
* rather than having point locations, observations are often
associated with spatial lines, areas, or grid cells
* spatial distances between observations are often not well
represented by straight-line distances, but by great circle
distances, distances through networks, or by measuring the effort
it takes getting from A to B
We introduce the concepts behind spatial data, coordinate reference
systems, spatial analysis, and introduce a number of packages,
including **sf** [@rjsf; @R-sf], **stars** [@R-stars], **s2** [@R-s2]
and **lwgeom** [@R-lwgeom],
as well as a number of spatial **tidyverse** [@welcome; @R-tidyverse] extensions, and a number of
spatial analysis and visualisation packages that can be used with these packages,
including **gstat** [@gstatcg; @R-gstat], **spdep** [@R-spdep], **spatialreg** [@R-spatialreg], **spatstat** [@baddeley2015spatial; @R-spatstat],
**tmap** [@tmap; @R-tmap] and **mapview** [@R-mapview].
Like data science, spatial data science seems to be a field that
arises bottom-up in and from many existing scientific disciplines
and industrial activities concerned with application of spatial
data, rather than being a sub-discipline of an existing scientific
discipline. Although there are various activities trying to scope it
through focused conferences, symposia, chairs and study programs,
we believe that the versatility of spatial data applications and
questions will render such activity hard. Giving this book the
title "spatial data science" is not another attempt to define the
bounds of this field but rather an attempt to contribute to it from
our 3-4 decades of experience working with researchers from
various fields willing to publicly share research questions, data,
and attempts to solve these questions with software. As a consequence,
the selection of topics found in this book has a certain bias towards
our own areas of research interest and experience. Platforms that
have helped create an open research community include the ai-geostats
and r-sig-geo mailing lists, sourceforge, r-forge, GitHub, and the
OpenGeoHub summer schools organized yearly since 2007. The current
possibility and willingness to cross data science language barriers
opens a new and very exciting perspective. Our motivation to
contribute to this field is a belief that open science leads to
better science, and that better science might contribute to a more
sustainable world.
::: {.content-visible when-format="html"}
## Printed version of this book
![](images/cover.jpg){fig-alt="cover art by Allison Horst" fig-align="left"}
The printed version of this book (1st edition) is available from the
[Routeledge/CRC](https://www.routledge.com/Spatial-Data-Science-With-Applications-in-R/Pebesma-Bivand/p/book/9781138311183)
or [Taylor and Francis](https://www.taylorfrancis.com/books/mono/10.1201/9780429459016/spatial-data-science-edzer-pebesma-roger-bivand)
websites. Cover art was created by [Allison Horst](https://mastodon.social/@allison_horst/110719442735137338).
## Citing this book
The full reference is: Pebesma, E.; Bivand, R. (2023). Spatial
Data Science: With Applications in R (1st ed.). 314 pages. Chapman and
Hall/CRC, Boca Raton. https://doi.org/10.1201/9780429459016
BibTeX:
```
@book{sds,
author = {Edzer Pebesma and Roger Bivand},
year = 2023,
title = {Spatial Data Science: With Applications in {R}},
publisher = {Chapman and Hall/CRC},
address = {Boca Raton},
URL = {https://doi.org/10.1201/9780429459016},
doi = {10.1201/9780429459016}
}
```
:::
## Acknowledgements {-}
We are grateful to the entire r-spatial community, especially those who
* developed r-spatial packages or contributed to their development,
* contributed to discussions on twitter and Mastodon `#rspatial`, or on GitHub,
* brought comments or asked questions in courses, summer
schools, or conferences.
We are in particular grateful to Dewey Dunnington for implementing
the **s2** package, and for active contributions from Sahil Bhandari,
Jonathan Bahlmann for preparing the figures in @sec-datacube,
Claus Wilke, Jakub Nowosad, the "Spatial Data Science with R"
classes of 2021 and 2022, and to those who actively contributed
with GitHub issues, pull requests, or discussions:
* to the book repository (Nowosad, jonathom, JaFro96,
singhkpratham, liuyadong, hurielreichel, PPaccioretti, Robinlovelace,
Syverpet, jonas-hurst, angela-li, ALanguillaume, florisvdh,
ismailsunni, andronaco),
* to the sf repository (aecoleman, agila5, andycraig, angela-li,
ateucher, barryrowlingson, bbest, BenGraeler, bhaskarvk, Bisaloo,
bkmgit, christophertull, chrisyeh96, cmcaine, cpsievert, daissi,
dankelley, DavisVaughan, dbaston, dblodgett-usgs, dcooley,
demorenoc, dpprdan, drkrynstrng, etiennebr, famuvie, fdetsch,
florisvdh, gregleleu, hadley, hughjonesd,
huizezhang-sherry, jeffreyhanson, jeroen, jlacko, joethorley,
joheisig, JoshOBrien, jwolfson, kadyb, karldw, kendonB, khondula,
KHwong12, krlmlr, lambdamoses, lbusett, lcgodoy, lionel-, loicdtx,
marwahaha, MatthieuStigler, mdsumner, MichaelChirico, microly,
mpadge, mtennekes, nikolai-b, noerw, Nowosad, oliverbeagley, Pakillo,
paleolimbot, pat-s, PPaccioretti, prdm0, ranghetti, rCarto, renejuan,
rhijmans, rhurlin, rnuske, Robinlovelace, robitalec, rubak, rundel,
statnmap, thomasp85, tim-salabim, tyluRp, uribo, Valexandre,
wibeasley, wittja01, yutannihilation, Zedseayou),
* to the stars repository (a-benini, ailich, ateucher, btupper,
dblodgett-usgs, djnavarro, ErickChacon, ethanwhite, etiennebr, flahn,
floriandeboissieu, gavg712, gdkrmr, jannes-m, jeroen, JoshOBrien,
kadyb, kendonB, mdsumner, michaeldorman, mtennekes, Nowosad, pat-s,
PPaccioretti, przell, qdread, Rekyt, rhijmans, rubak, rushgeo,
statnmap, uribo, yutannihilation),
* to the s2 repository (kylebutts, spiry34, jeroen, eddelbuettel).
::: {.content-visible when-format="html"}
<a href="https://mastodon.social/@edzer" rel="me"></a>
:::