You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for your work. I was wondering how do you deal with gradient updates on the non-searchable stages of the model.
The searchable layers will only be updated once, but multiple forward and backward passess would then go through the tail/stem and the detection head. Would you perhaps average the gradients ? or perhaps freeze the parameters of the non-searchable stages ?
The text was updated successfully, but these errors were encountered:
Hi,
Thanks for your work. I was wondering how do you deal with gradient updates on the non-searchable stages of the model.
The searchable layers will only be updated once, but multiple forward and backward passess would then go through the tail/stem and the detection head. Would you perhaps average the gradients ? or perhaps freeze the parameters of the non-searchable stages ?
The text was updated successfully, but these errors were encountered: