This is a repository for publicly available anonymized and pseudo-anonymized mobility data from the ohmage Mobility Android app (source, app). The Mobility app collects a data point every minute or every five minutes, depending on user preference.
The data is available as CSV exports. There is a pseudo-anonymized example here and a anonymized example here.
The data can be found in the data directory of this repo. Each directory represents one user's data. The data has been cleaned and is valid CSV and JSON, but irregularities may exist. Each directory contains gzipped tar archives that contain chunks of data for one to six month periods. Very gappy time periods have been removed. Each archive is roughly 50 MB or less.
If you have questions, feel free to put in an issue. Otherwise, clone this repo and analyze away!
id | A unique integer identifier for this object given its presence in a list of other mobility data points. |
uuid | As above, but it is a UUID. |
time | The standard Unix UTC milliseconds from January 1, 1970 up to the time this point was created. |
time_offset | The number of milliseconds to convert time to the timezone where the point was collected. |
timezone | The human-readable "zoneinfo" timezone. May be null. |
location_timestamp | The timestamp at which the location was determined, which may be different than the time the point was created. May be null. |
latitude and longitude | The location of the user at the time of location_timestamp. May be null. |
location_accuracy | This value may be null, but if it isn't, the Android developer docs state the following.
|
location_provider | May be null. The system entity that supplied the location. The possible values are
You can check out the source code for WiFiGPSLocation here. Note that other values for location_provider are possible for this field, but the list above contains the values that are officially supported. |
wifi_data | A list of Wi-Fi access points and the time at which the access points were collected. A typical list size is between 5 and 10 points though it can be larger. |
scan | A list of ssid-strength pairs. Strength is a negative integer that represents stronger signal strength as it approaches zero. |
speed | The speed (not velocity) of the mobile device compared to the previous GPS reading. Can be zero if obtaining a GPS lock was unsuccessful. |
accel_data | A list of accelerometer readings. The list is typically between 30 and 50 triaxial points. |
mode | The classified ambulatory mode based on our classification algorithm applied to this point. The value will be one of still, walk, run, drive, or error. The error mode occurs if the classifier cannot determine the mode based on the data for the point. |