-
Notifications
You must be signed in to change notification settings - Fork 33
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Benchmarks and efficiency improvements for MergedChoiceTable #11
Comments
You rang. :-P Do you have a reproducible test case and sample data I can use to profile? |
Good question :) The example that inspired this issue was from one of our lab projects, but it will be easier to work with the travel survey dataset that we've been using to put together demo notebooks. Instructions for loading the data are here: This notebook sets up equivalent data merges using All the underlying code is in this file: |
I did not have time to experiment with this before my vacation. Just from looking at the code without running it. For Other thoughts, I did not understand this removing perhaps we can find a clearer way to get the same functionality? |
My guess about those lines is that it's handling a case where invalid alternatives where included in the input data. Seems better to me to just enforce that all the inputs be valid. In general, the code we ported over from |
For large datasets,
MergedChoiceTable
is pretty slow.Example:
Can we speed this up? A quick investigation could look at:
How does the speed of
MergedChoiceTable
compare with the underlyingmnl_interaction_dataset()
?Which use cases and parts of the code cause the biggest bottlenecks?
How might we speed those things up?
The text was updated successfully, but these errors were encountered: