Add Prediction Smoothing #696

yeldarby · 2023-12-27T21:09:18Z

Description

Adds a Smoother class which averages the last several predictions to decrease noise (like wobbly or flickering boxes).

Before & after smoothing:

smoothed-grocery-example-480.mp4

Type of change

New feature (non-breaking change which adds functionality)

How has this change been tested, please provide a testcase or example of how you tested the change?

Installed locally & tested on several videos/models with InferencePipeline.

Any specific deployment considerations

No.

Docs

Docs updated? What were the changes: Added new docs page with video & example code.

…pervision into prediction-smoothing

supervision/detection/smoother.py

SkalskiP · 2023-12-28T11:00:52Z

supervision/detection/smoother.py

+    > _On the left are the model's raw predictions,
+    > on the right is the output of Smoother._
+
+    ## Example Usage:


Can we show a more general example? I assume Smoother can work with any object detection model.

Sure, what do you have in mind? I figured person detection with COCO using one of the videos from assets was about as general as it gets to demonstrate the functionality but we can change it to something else.

Good candidate videos to communicate the functionality would have

A small number of objects moving fairly subtly (otherwise it's really distracting trying to understand what you're supposed to be comparing side by side)

Prediction jitter coming from the model

For example, while it works with vehicles-2.mp4, it's not a good example video because there's too much going on and the things are moving too fast to be able to grok what's going on at the detection by detection level.

mkdocs.yml

supervision/detection/smoother.py

SkalskiP · 2023-12-28T16:59:09Z

@yeldarby, please let me know once the PR is ready for the next review round.

…pervision into prediction-smoothing

yeldarby · 2023-12-28T18:23:48Z

PR is updated in response to your comments here & @capjamesg's comments on Slack.

SkalskiP · 2023-12-29T11:41:41Z

@yeldarby I started the second round of review, but I got confused and decided to ask here:

Is tracker_length still needed? Because it is not used, it is not documented and generally unrelated to the Smoother task (smoothing boxes).
If tracker_length is not needed, do we need self.track_starts and self.track_ends ? self.track_starts is only there to support tracker_length and self.track_ends is used in two places but the second place can be easily implemented differently.
If we don't need self.track_starts and self.track_ends we probably do not need self.current_frame.

All around, the whole logic can get a lot easier if we drop tracker_length, which, as I said, is not used; it is not documented and, in general, is unrelated to the Smoother task.

yeldarby · 2023-12-29T14:53:50Z

Is tracker_length still needed?

I'm using it in some client code to create animated Annotators. The frame of the animation for a detection needs to be tracker_length%num_frames.

Also planning to use it to let people choose if we should wait to display a detection until it's been seen some number of times so that if something is detected in a frame or two but not again it doesn't flicker onto the screen (but haven't implemented that here just yet).

Because it is not used, it is not documented

Good call; if we keep it here I'll document it.

and generally unrelated to the Smoother task (smoothing boxes).

The entry delay will be related to this (though that's easier since you should be able to just take the length of the non-None detections from the tracker; it'd be a tiny bit slower to sum vs reference a count but that's negligible given length will typically be small).

But yeah I think this probably makes a bit more structural sense at the Tracker level (even in my primary use-case you should be able to have an animated Annotator w/o using Smoother); want me to move it there?

yeldarby · 2023-12-29T17:42:47Z

Updated to remove tracker_length and track_starts.

SkalskiP · 2023-12-29T08:48:32Z