Is MinVIS truly online? #2

timmeinhardt · 2022-08-30T19:08:21Z

First of all, congratulations on this paper. It was a very interesting read. However, I think technically MinVIS can not be considered an online method. You are processing each frame separately but an online method must not include information of future frames for the decision making on the current frame. In this line

MinVIS/minvis/video_maskformer_model.py

Line 308 in 3038871

out_logits = sum(out_logits)/len(out_logits)

you compute mean scores for each query and class across the entire sequence. These scores are later used for the topk selection of final outputs. While your frame processing might be online, the utilization of information of all frames at once means your decision making is not. Please clarify what I might be misunderstanding or your point of view on the matter. Thank you!

JialianW · 2022-09-01T00:48:41Z

To my understanding, this is only for evaluation purpose. VIS requires each tacklet to have a score in order to compute mAP. While the most straightforward way to get a score for a tracklet is to average across frames. I don't think this part would be used in real streaming applications.

timmeinhardt · 2022-09-01T02:46:04Z

I agree that the GT and prediction file design of YouTube-VIS and OVIS both invite to process their data in an offline fashion but it is possible to generate appropriate full-sequence tracks even for true online methods. For example, IDOL not only processes sequences online but never uses information of future frames for the mask/score prediction of the current frame. This requires full frame to frame track management and to suffice the VIS GT format they retroactively fill in missing/occluded frames with zeros (see here).

In a real world streaming application an online track management would also be necessary. You can not apply your current method to a video stream and produce reasonable tracks considering objects getting occluded or entering/leaving the sequence. But most importantly when it comes to the evaluation and comparability of methods computing scores over the full sequence is usually considered to be offline. Using global information/averaging would probably benefit IDOL as well. Hence, I think the comparison is not fair.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is MinVIS truly online? #2

Is MinVIS truly online? #2

timmeinhardt commented Aug 30, 2022 •

edited

Loading

JialianW commented Sep 1, 2022 •

edited

Loading

timmeinhardt commented Sep 1, 2022 •

edited

Loading

Is MinVIS truly online? #2

Is MinVIS truly online? #2

Comments

timmeinhardt commented Aug 30, 2022 • edited Loading

JialianW commented Sep 1, 2022 • edited Loading

timmeinhardt commented Sep 1, 2022 • edited Loading

timmeinhardt commented Aug 30, 2022 •

edited

Loading

JialianW commented Sep 1, 2022 •

edited

Loading

timmeinhardt commented Sep 1, 2022 •

edited

Loading