Implementation of algorithm one from the paper #8

rcmalli · 2021-02-17T09:53:09Z

This PR is the initial effort for implementing Algorithm one for online learning using Warpgrad. I started analysing the implementation of algorithm 2. Since online learning algorithm does not require to store datapoints and model states in the buffer, I have reused step function from warpgrad.utils inside inner training loop.

Summary of changes:

New wrapper for online algorithm added. This reuses functions from warpgrad.utils
Simple updater class is added. However, it works only as placeholder and does nothing in the backward pass call. I am not sure if leap based initialization should be applied also for online learning.
step function is called inside run_batches function of the wrapper class for each k times of inner update.
Generated losses are accumulated using meta_loss property of wrapper class.

rcmalli · 2021-02-17T10:14:59Z

src/omniglot/wrapper.py

+            # This line breakes gradient computation for now
+            # meta_layers required_grad properties are set to False if
+            # we call init_adaptation
+            # self.model.init_adaptation()


Calling self.model.init_adaptation() produces error when calling backward() at the end of each meta_batch since it sets meta_layer's require_grad properties to False. This may need us to freeze/unfreeze meta_layers in a more controlled way.

rcmalli · 2021-02-18T16:52:48Z

src/omniglot/wrapper.py

+        if meta_train:
+            # at the end of collection for K steps N tasks we do the backward
+            # pass.
+            backward(self.meta_loss, self.model.meta_parameters(
+                include_init=False))
+            self._final_meta_update()


When we collect k times inner iteration for N tasks we can call backward pass calculate gradients.

rcmalli · 2021-02-18T16:55:21Z

src/omniglot/wrapper.py

+            if meta_train:
+                opt = SGD(self.model.optimizer_parameter_groups(tensor=True))
+                opt.zero_grad()
+                outer_input, outer_target = next(iter(batches))
+                l_outer, (l_inner, a1, a2) = step(
+                    criterion=self.criterion,
+                    x_inner=inner_input, x_outer=outer_input,
+                    y_inner=inner_target, y_outer=outer_target,
+                    model=self.model,
+                    optimizer=opt, scorer=None)
+                self.meta_loss = self.meta_loss + l_outer
+                del l_inner, a1, a2


These lines are calculating outer_loss at each state of model parameter \theta_{k}^{\tau}. However, I am not sure about how should we handle freezing and unfreezing meta and adaptation layers.

According to pseudocode, gradients of \theta_{0} must be collected using \theta_{0: k}^{\tau}. How should we implement it correctly?

rcmalli · 2021-02-18T16:56:50Z

src/warpgrad/warpgrad/updaters.py

+        # init_objective = INIT_OBJECTIVES[self.init_objective]
+        # init_objective(model.named_init_parameters(suffix=None),
+        #                params, self.norm, self.bsz, step_fn)
+        pass


I have commented out initialization objective for now. Should we also use leap based initialization for online learning?

rcmalli added 4 commits February 16, 2021 21:34

Add online algorithm wrapper

9b1afe0

Add simple updater as placeholder

a5157c2

Add condition for checking buffer

793564b

Update updaters.py

0a3e902

rcmalli commented Feb 17, 2021

View reviewed changes

rcmalli commented Feb 18, 2021

View reviewed changes

add handling initialization state

bb23226

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementation of algorithm one from the paper #8

Implementation of algorithm one from the paper #8

rcmalli commented Feb 17, 2021 •

edited

Loading

rcmalli Feb 17, 2021 •

edited

Loading

rcmalli Feb 18, 2021

rcmalli Feb 18, 2021 •

edited

Loading

rcmalli Feb 18, 2021

Implementation of algorithm one from the paper #8

Are you sure you want to change the base?

Implementation of algorithm one from the paper #8

Conversation

rcmalli commented Feb 17, 2021 • edited Loading

rcmalli Feb 17, 2021 • edited Loading

Choose a reason for hiding this comment

rcmalli Feb 18, 2021

Choose a reason for hiding this comment

rcmalli Feb 18, 2021 • edited Loading

Choose a reason for hiding this comment

rcmalli Feb 18, 2021

Choose a reason for hiding this comment

rcmalli commented Feb 17, 2021 •

edited

Loading

rcmalli Feb 17, 2021 •

edited

Loading

rcmalli Feb 18, 2021 •

edited

Loading