Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some dwell time timestamps are missing or out of sequence #206

Open
glciampaglia opened this issue Aug 7, 2023 · 3 comments
Open

Some dwell time timestamps are missing or out of sequence #206

glciampaglia opened this issue Aug 7, 2023 · 3 comments
Assignees
Labels
backend Issues related to the backend of Rockwell bug Something isn't working frontend Issues related to the frontend of Rockwell

Comments

@glciampaglia
Copy link
Collaborator

glciampaglia commented Aug 7, 2023

From the first pilot with Rockwell endless feed (which we ran in August 2023) we noticed one participant had a single missing timestamp. For now, we think this due to data being dropped by the browser of the participant (for example if browser has too many tabs open, etc.), so we will not do anything about it. If this issue appears again, we might discuss it.

@glciampaglia glciampaglia added this to the Utopia milestone Aug 7, 2023
@saumyabhadani95
Copy link
Collaborator

Actually around 6-7 participants have missing timestamps (from manual analysis). Also, the missing timestamps might also be causing the timestamps to not increase monotonically (timestamp of tweet with rank r is sometimes greater than timestamp with rank r + 1 More description in Issue 2 in this link)

The most probable reason for this would be that the rank of the tweet is not correctly being calculated on the frontend. For example, its possible that when tweet with rank r is in view, the frontend incorrectly calculates that the rank of the tweet as dr which can be r+1 or r-1

To solve this, we can reduce the calculation on the frontend side and try to get more raw data. The rank of "tweet in view" is calculated using different heights like total height of the feed and height of individual tweet cards, etc. So one possible way to get the most accurate data would be to get these different heights for every scroll.

@glciampaglia
Copy link
Collaborator Author

We discussed this issue internally within UMD team, and we decided that we will report this finding back to the rest of the team and discuss with them further what to do. If the analysis we want to use these data for can be performed with some missing data, then we should be OK with having some of missing/inaccurate timestamps. If instead we need 100% of all timestamps to be exactly correct, then we could discuss ways to reduce the chances of either problems, for example recording the raw position data used to determine the index of the tweet in view and/or move some computation from the participants' browser to the server.

@glciampaglia glciampaglia added bug Something isn't working frontend Issues related to the frontend of Rockwell backend Issues related to the backend of Rockwell labels Aug 14, 2023
@glciampaglia glciampaglia changed the title Missing timestamps Some timestamps are missing or out of sequence. Aug 14, 2023
@glciampaglia glciampaglia changed the title Some timestamps are missing or out of sequence. Some timestamps are missing or out of sequence Aug 14, 2023
@glciampaglia
Copy link
Collaborator Author

We discussed the issue again, this time with the full team. The consensus was that it is OK to have some missing or inaccurate data, as long as we are aware that when we analyze them, we need to "repair" the data. This can be done in two ways:

  1. For tweet ranks that have no timestamp, we simply take the timestamp of the two closest tweets, and set the timestamp as the average of the two.
  2. For tweet ranks where the timestamp is out of sequence, we apply the same idea as above, but now the average timestamp is prepended to the out-of-sequence timestamp.

Finally, we decided that we will not make any further modification to Robert's code.

@glciampaglia glciampaglia changed the title Some timestamps are missing or out of sequence Some dwell time timestamps are missing or out of sequence Sep 11, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backend Issues related to the backend of Rockwell bug Something isn't working frontend Issues related to the frontend of Rockwell
Projects
None yet
Development

No branches or pull requests

2 participants