Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Map inferred_user_id in snowplow_page_views #60

Open
jtcohen6 opened this issue May 24, 2019 · 1 comment
Open

Map inferred_user_id in snowplow_page_views #60

jtcohen6 opened this issue May 24, 2019 · 1 comment

Comments

@jtcohen6
Copy link
Contributor

Rationale

The Snowplow package has two first-class outputs: snowplow_sessions and snowplow_page_views. Both should include the stitched user identity, inferred_user_id, that is the product of snowplow_id_map.

We frequently find ourselves building reports on top of both tables, and we should be able to perform counts of distinct visitors that agree between them.

@jtcohen6 jtcohen6 added this to the 0.8.0 milestone May 24, 2019
@drewbanin
Copy link
Collaborator

Really good idea! Can we additionally try to set the user_id field more appropriately for snowplow_page_views? Right now, it's only set if the user's persistent id was known at the time of that page view. If we're going to join to the id map, are we also able to set the snowplow_user_custom_id field if it was known in the scope of the session?

@jtcohen6 jtcohen6 modified the milestones: 0.8.0, 0.9.0 Oct 21, 2019
@jtcohen6 jtcohen6 modified the milestones: 0.9.0, 0.10.0 Mar 27, 2020
@jtcohen6 jtcohen6 removed this from the 0.11 milestone Jun 12, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants