-
Notifications
You must be signed in to change notification settings - Fork 66
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ensemble.fit gets stuck #18
Comments
I think I have run into the same problem previously. I meant to open an issue before now! I think the issue is caused by a divide by zero creating Nans in line 37 and 38 of diagnostics.kruskal_align. Since it is always 0/0 that is being computed at that stage I fixed it with a kluge resetting Nans to 0.
Hope this helps! @ahwillia thanks for everything! Sorry for not posting this sooner! |
So this happens when a factor becomes fully zero? Perhaps the right way to fix this would be to be to drop that factor out the model. Effectively, the model would have rank |
Here is my fix -- ahwillia@1014e53. Let me know if this works for you. If the problem persists we can re-open the issue. |
Thanks! I think this is a good solution. The only thing I noticed from my experience was that after training you never ended up with a zero component. Is it possible for a factor to go to zero transiently? It has been a long time since I actually had to deal with this. |
Ideally the factor should never go to zero. But depending on the
optimization algorithm I think it might.
…On Mon, Sep 9, 2019, 1:32 PM Kelly McGuire ***@***.***> wrote:
Thanks! I think this is a good solution.
The only thing I noticed from my experience was that after training you
never ended up with a zero component. Is it possible for a factor to go to
zero transiently? It has been a long time since I actually had to deal with
this.
—
You are receiving this because you modified the open/close state.
Reply to this email directly, view it on GitHub
<https://github.com/ahwillia/tensortools/issues/18?email_source=notifications&email_token=AAE3NUKIXLJIVMCLF3FOSATQI2XEPA5CNFSM4H26MSZKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD6I6AHA#issuecomment-529653788>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AAE3NUMCP2ZHNAP2SAQRUODQI2XEPANCNFSM4H26MSZA>
.
|
For me when fitting ensembles this now breaks in a different way. Seems like removing a zero factor causes a problem with the permutation.
|
Thanks for helping me find and fix these corner cases... New commit here should hopefully fix the error you were getting: ahwillia@b9c4fef Can you |
I am fitting an ensemble of tensor decompositions (
ncp_bcp
) on my data, however, on some sessions, the call tofit
gets stuck after fitting models of all ranks. Model fitting seems to take the same amount of time on all sessions (which is expected as they all have comparable numbers of neurons/trials), but on some, whatever happens after model fitting (component aligment?) takes a huge amount of time. Terminating execution manually:It could be the case that for the sessions on which the gets stuck the data tensor is a little bit bigger, but in practice still really small, e.g. 23 by 96 by 247 (neurons x time points x trials).
This is also related to the rank of the ensemble I am fitting. For the example tensor of size (23, 96, 247), if I fit an ensemble up to rank 14, it runs fine, if I push it to rank 16 it gets stuck.
Do you have any ideas why the time required for the aligment would blow up under certain conditions and on how to avoid it?
The text was updated successfully, but these errors were encountered: