diff --git a/docs/01_data_preparation.html b/docs/01_data_preparation.html index 44bd14f60..a884c4a08 100644 --- a/docs/01_data_preparation.html +++ b/docs/01_data_preparation.html @@ -520,11 +520,11 @@
DataE
-2022-10-10 at 08:17:50 | INFO | Retrieving COVID-19 dataset from https://github.com/lisphilar/covid19-sir/data/
-2022-10-10 at 08:17:50 | INFO | Retrieving datasets from COVID-19 Data Hub https://covid19datahub.io/
-2022-10-10 at 08:17:58 | INFO | Retrieving datasets from Our World In Data https://github.com/owid/covid-19-data/
-2022-10-10 at 08:18:00 | INFO | Retrieving datasets from Our World In Data https://github.com/owid/covid-19-data/
-2022-10-10 at 08:18:00 | INFO | Retrieving datasets from Our World In Data https://github.com/owid/covid-19-data/
+2022-10-10 at 08:54:45 | INFO | Retrieving COVID-19 dataset from https://github.com/lisphilar/covid19-sir/data/
+2022-10-10 at 08:54:45 | INFO | Retrieving datasets from COVID-19 Data Hub https://covid19datahub.io/
+2022-10-10 at 08:54:54 | INFO | Retrieving datasets from Our World In Data https://github.com/owid/covid-19-data/
+2022-10-10 at 08:54:56 | INFO | Retrieving datasets from Our World In Data https://github.com/owid/covid-19-data/
+2022-10-10 at 08:54:56 | INFO | Retrieving datasets from Our World In Data https://github.com/owid/covid-19-data/
@@ -533,7 +533,7 @@ 1-1. With DataE
-<covsirphy.engineering.engineer.DataEngineer at 0x7f0af2c26d60>
+<covsirphy.engineering.engineer.DataEngineer at 0x7fe409795df0>
We can get the all downloaded records as a pandas.DataFrame
with DataEngineer().all()
method.
@@ -553,37 +553,37 @@ 1-1. With DataE
<class 'pandas.core.frame.DataFrame'>
-RangeIndex: 226891 entries, 0 to 226890
+RangeIndex: 226961 entries, 0 to 226960
Data columns (total 27 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
- 0 ISO3 226891 non-null category
- 1 Province 226891 non-null category
- 2 City 226891 non-null category
- 3 Date 226891 non-null datetime64[ns]
- 4 Cancel_events 178070 non-null Float64
- 5 Confirmed 205577 non-null Float64
- 6 Contact_tracing 178013 non-null Float64
- 7 Country 224419 non-null string
- 8 Fatal 189037 non-null Float64
- 9 Gatherings_restrictions 178049 non-null Float64
- 10 Information_campaigns 178044 non-null Float64
- 11 Internal_movement_restrictions 178056 non-null Float64
- 12 International_movement_restrictions 178036 non-null Float64
- 13 Population 223427 non-null Float64
+ 0 ISO3 226961 non-null category
+ 1 Province 226961 non-null category
+ 2 City 226961 non-null category
+ 3 Date 226961 non-null datetime64[ns]
+ 4 Cancel_events 178167 non-null Float64
+ 5 Confirmed 205651 non-null Float64
+ 6 Contact_tracing 178109 non-null Float64
+ 7 Country 224493 non-null string
+ 8 Fatal 189110 non-null Float64
+ 9 Gatherings_restrictions 178146 non-null Float64
+ 10 Information_campaigns 178140 non-null Float64
+ 11 Internal_movement_restrictions 178153 non-null Float64
+ 12 International_movement_restrictions 178132 non-null Float64
+ 13 Population 223501 non-null Float64
14 Product 120930 non-null string
- 15 Recovered 71762 non-null Float64
- 16 School_closing 178088 non-null Float64
- 17 Stay_home_restrictions 177980 non-null Float64
- 18 Stringency_index 178005 non-null Float64
- 19 Testing_policy 178026 non-null Float64
- 20 Tests 86109 non-null Float64
- 21 Transport_closing 178042 non-null Float64
+ 15 Recovered 71763 non-null Float64
+ 16 School_closing 178185 non-null Float64
+ 17 Stay_home_restrictions 178077 non-null Float64
+ 18 Stringency_index 178102 non-null Float64
+ 19 Testing_policy 178122 non-null Float64
+ 20 Tests 86110 non-null Float64
+ 21 Transport_closing 178139 non-null Float64
22 Vaccinated_full 49138 non-null Float64
23 Vaccinated_once 51722 non-null Float64
24 Vaccinations 54420 non-null Float64
25 Vaccinations_boosters 27163 non-null Float64
- 26 Workplace_closing 178051 non-null Float64
+ 26 Workplace_closing 178148 non-null Float64
dtypes: Float64(21), category(3), datetime64[ns](1), string(2)
memory usage: 47.0 MB
@@ -624,8 +624,8 @@ 1-1. With DataE
-2022-10-10 at 08:18:06 | INFO | Retrieving COVID-19 dataset from https://github.com/lisphilar/covid19-sir/data/
-2022-10-10 at 08:18:07 | INFO | Retrieving datasets from COVID-19 Data Hub https://covid19datahub.io/
+2022-10-10 at 08:55:02 | INFO | Retrieving COVID-19 dataset from https://github.com/lisphilar/covid19-sir/data/
+2022-10-10 at 08:55:03 | INFO | Retrieving datasets from COVID-19 Data Hub https://covid19datahub.io/
@@ -816,7 +816,7 @@ 1-1. With DataE
-2022-10-10 at 08:18:10 | INFO | Retrieving datasets from COVID-19 Data Hub https://covid19datahub.io/
+2022-10-10 at 08:55:06 | INFO | Retrieving datasets from COVID-19 Data Hub https://covid19datahub.io/
@@ -1020,37 +1020,37 @@ 1-2. With DataD
<class 'pandas.core.frame.DataFrame'>
-RangeIndex: 226891 entries, 0 to 226890
+RangeIndex: 226961 entries, 0 to 226960
Data columns (total 27 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
- 0 ISO3 226891 non-null string
- 1 Province 226891 non-null string
- 2 City 226891 non-null string
- 3 Date 226891 non-null datetime64[ns]
- 4 Cancel_events 178070 non-null Float64
- 5 Confirmed 205577 non-null Float64
- 6 Contact_tracing 178013 non-null Float64
- 7 Country 224419 non-null string
- 8 Fatal 189037 non-null Float64
- 9 Gatherings_restrictions 178049 non-null Float64
- 10 Information_campaigns 178044 non-null Float64
- 11 Internal_movement_restrictions 178056 non-null Float64
- 12 International_movement_restrictions 178036 non-null Float64
- 13 Population 223427 non-null Float64
+ 0 ISO3 226961 non-null string
+ 1 Province 226961 non-null string
+ 2 City 226961 non-null string
+ 3 Date 226961 non-null datetime64[ns]
+ 4 Cancel_events 178167 non-null Float64
+ 5 Confirmed 205651 non-null Float64
+ 6 Contact_tracing 178109 non-null Float64
+ 7 Country 224493 non-null string
+ 8 Fatal 189110 non-null Float64
+ 9 Gatherings_restrictions 178146 non-null Float64
+ 10 Information_campaigns 178140 non-null Float64
+ 11 Internal_movement_restrictions 178153 non-null Float64
+ 12 International_movement_restrictions 178132 non-null Float64
+ 13 Population 223501 non-null Float64
14 Product 120930 non-null string
- 15 Recovered 71762 non-null Float64
- 16 School_closing 178088 non-null Float64
- 17 Stay_home_restrictions 177980 non-null Float64
- 18 Stringency_index 178005 non-null Float64
- 19 Testing_policy 178026 non-null Float64
- 20 Tests 86109 non-null Float64
- 21 Transport_closing 178042 non-null Float64
+ 15 Recovered 71763 non-null Float64
+ 16 School_closing 178185 non-null Float64
+ 17 Stay_home_restrictions 178077 non-null Float64
+ 18 Stringency_index 178102 non-null Float64
+ 19 Testing_policy 178122 non-null Float64
+ 20 Tests 86110 non-null Float64
+ 21 Transport_closing 178139 non-null Float64
22 Vaccinated_full 49138 non-null Float64
23 Vaccinated_once 51722 non-null Float64
24 Vaccinations 54420 non-null Float64
25 Vaccinations_boosters 27163 non-null Float64
- 26 Workplace_closing 178051 non-null Float64
+ 26 Workplace_closing 178148 non-null Float64
dtypes: Float64(21), datetime64[ns](1), string(5)
memory usage: 51.3 MB
@@ -2025,7 +2025,7 @@ 2-2. Convert line list to the number of cases data
-2022-10-10 at 08:19:35 | INFO | Retrieving GIS data from Natural Earth https://www.naturalearthdata.com/
+2022-10-10 at 08:56:38 | INFO | Retrieving GIS data from Natural Earth https://www.naturalearthdata.com/
@@ -2081,8 +2081,8 @@ 2-3. Retrieve total population data
-2022-10-10 at 08:19:38 | INFO | Retrieving datasets from World Population Prospects https://population.un.org/wpp/
-2022-10-10 at 08:19:44 | INFO | [INFO] 'Province' layer was removed.
+2022-10-10 at 08:56:42 | INFO | Retrieving datasets from World Population Prospects https://population.un.org/wpp/
+2022-10-10 at 08:56:47 | INFO | [INFO] 'Province' layer was removed.
@@ -515,37 +515,37 @@ 1. Data cleaning
<class 'pandas.core.frame.DataFrame'>
-RangeIndex: 226957 entries, 0 to 226956
+RangeIndex: 227027 entries, 0 to 227026
Data columns (total 27 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
- 0 ISO3 226957 non-null category
- 1 Province 226957 non-null category
- 2 City 226957 non-null category
- 3 Date 226957 non-null datetime64[ns]
- 4 Cancel_events 226957 non-null Float64
- 5 Confirmed 226957 non-null Float64
- 6 Contact_tracing 226957 non-null Float64
- 7 Country 226957 non-null object
- 8 Fatal 226957 non-null Float64
- 9 Gatherings_restrictions 226957 non-null Float64
- 10 Information_campaigns 226957 non-null Float64
- 11 Internal_movement_restrictions 226957 non-null Float64
- 12 International_movement_restrictions 226957 non-null Float64
- 13 Population 226957 non-null Float64
- 14 Product 226957 non-null object
- 15 Recovered 226957 non-null Float64
- 16 School_closing 226957 non-null Float64
- 17 Stay_home_restrictions 226957 non-null Float64
- 18 Stringency_index 226957 non-null Float64
- 19 Testing_policy 226957 non-null Float64
- 20 Tests 226957 non-null Float64
- 21 Transport_closing 226957 non-null Float64
- 22 Vaccinated_full 226957 non-null Float64
- 23 Vaccinated_once 226957 non-null Float64
- 24 Vaccinations 226957 non-null Float64
- 25 Vaccinations_boosters 226957 non-null Float64
- 26 Workplace_closing 226957 non-null Float64
+ 0 ISO3 227027 non-null category
+ 1 Province 227027 non-null category
+ 2 City 227027 non-null category
+ 3 Date 227027 non-null datetime64[ns]
+ 4 Cancel_events 227027 non-null Float64
+ 5 Confirmed 227027 non-null Float64
+ 6 Contact_tracing 227027 non-null Float64
+ 7 Country 227027 non-null object
+ 8 Fatal 227027 non-null Float64
+ 9 Gatherings_restrictions 227027 non-null Float64
+ 10 Information_campaigns 227027 non-null Float64
+ 11 Internal_movement_restrictions 227027 non-null Float64
+ 12 International_movement_restrictions 227027 non-null Float64
+ 13 Population 227027 non-null Float64
+ 14 Product 227027 non-null object
+ 15 Recovered 227027 non-null Float64
+ 16 School_closing 227027 non-null Float64
+ 17 Stay_home_restrictions 227027 non-null Float64
+ 18 Stringency_index 227027 non-null Float64
+ 19 Testing_policy 227027 non-null Float64
+ 20 Tests 227027 non-null Float64
+ 21 Transport_closing 227027 non-null Float64
+ 22 Vaccinated_full 227027 non-null Float64
+ 23 Vaccinated_once 227027 non-null Float64
+ 24 Vaccinations 227027 non-null Float64
+ 25 Vaccinations_boosters 227027 non-null Float64
+ 26 Workplace_closing 227027 non-null Float64
dtypes: Float64(21), category(3), datetime64[ns](1), object(2)
memory usage: 47.0+ MB
@@ -612,11 +612,11 @@ 2. Data transformation
- 226952
+ 227022
ZWE
-
-
- 2022-10-04
+ 2022-10-05
14439018.0
14181450.0
257568.0
@@ -625,24 +625,24 @@ 2. Data transformation82994.0
- 226953
+ 227023
ZWE
-
-
- 2022-10-05
+ 2022-10-06
14439018.0
- 14181450.0
- 257568.0
- 168971.0
- 5603.0
+ 14181363.0
+ 257655.0
+ 169057.0
+ 5604.0
82994.0
- 226954
+ 227024
ZWE
-
-
- 2022-10-06
+ 2022-10-07
14439018.0
14181363.0
257655.0
@@ -651,11 +651,11 @@ 2. Data transformation82994.0
- 226955
+ 227025
ZWE
-
-
- 2022-10-07
+ 2022-10-08
14439018.0
14181363.0
257655.0
@@ -664,11 +664,11 @@ 2. Data transformation82994.0
- 226956
+ 227026
ZWE
-
-
- 2022-10-08
+ 2022-10-09
14439018.0
14181363.0
257655.0
@@ -727,11 +727,11 @@ 2. Data transformation
- 226952
+ 227022
ZWE
-
-
- 2022-10-04
+ 2022-10-05
14439018
14181450.0
257568
@@ -740,24 +740,24 @@ 2. Data transformation82994.0
- 226953
+ 227023
ZWE
-
-
- 2022-10-05
+ 2022-10-06
14439018
- 14181450.0
- 257568
- 168971.0
- 5603.0
+ 14181363.0
+ 257655
+ 169057.0
+ 5604.0
82994.0
- 226954
+ 227024
ZWE
-
-
- 2022-10-06
+ 2022-10-07
14439018
14181363.0
257655
@@ -766,11 +766,11 @@ 2. Data transformation82994.0
- 226955
+ 227025
ZWE
-
-
- 2022-10-07
+ 2022-10-08
14439018
14181363.0
257655
@@ -779,11 +779,11 @@ 2. Data transformation82994.0
- 226956
+ 227026
ZWE
-
-
- 2022-10-08
+ 2022-10-09
14439018
14181363.0
257655
@@ -850,16 +850,7 @@ 3. Arithmetic operations
-