-
Notifications
You must be signed in to change notification settings - Fork 47
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Machine Learning Punch List for Launch #110
Comments
Once we are happy with the final notebook, we should do a final cleanup (there is still a warning in the import section) and then I will do a PR into |
@rdvelazquez @dhimmel @wisygig Because the launch party is coming up, I was going to open a PR to push the notebook to production this week/weekend. I think Ryan has made very good progress, and our focus should probably be put towards making sure that his changes work on our AWS instances. |
I agree & fully support! |
Sounds good to me! |
The larger query for TP53 as the gene and all diseases selected runs into a memory error. |
Okay @kurtwheeler and I will discuss our options tomorrow and let you know what we're thinking. |
Any updates on this? Should we reduce the hyperparameter search space? We could probably cut it in half, by looking at alpha for every 0.2 instead of 0.1. |
I'm hoping to get to this this afternoon. What I'm thinking is run the pathological query (TP53 as the gene and all diseases selected) locally and see how much memory is consumed. I would use this technique which @yl565 previously implemented. This will let us know whether our AWS image size is too small or there is another issue where memory isn't fully allocated.
Does this affect memory usage now that we're using |
Oh, I'm not sure, I just assumed that's where the memory burden still was. I think @rdvelazquez would have a better answer. |
I assume that the hyperparameter space affects memory usage. I'm not positive but I don't see how it wouldn't. I also assume that the number of PCA components has a much greater affect on memory than alpha_range so we may also need to evaluate the number of PCA components. Once we have the notebook code set-up to be profiled we can easily adjust the hyperparameter space and re-profile to quantify how changes to the hyperparameter space affects memory (if at all). |
So I installed memory profiler ( Increasing
So we can keep at |
I've changed the cognoma EC2 instances from |
Me too! https://api.cognoma.org/diseases/ returns a 503 code.
@kurtwheeler and I will look into what failed tomorrow! |
@rdvelazquez https://api.cognoma.org should now be back up. @kurtwheeler fixed it this morning. We had changed the instance type, but had not destroyed and recreated the instances (which ECS apparently requires). |
It's pretty fast too! |
Here's the general punch list that we discussed at tonight's meetup for getting the machine learning part of cognoma launch ready.
To be completed at a later date: Templating for jupyter notebooks (@wisygig)
The text was updated successfully, but these errors were encountered: