The idea is to take a video with people walking on a street, detect people, measure distance between them and if the distance between two persons is less than a certain threshold draw red rectangles around each of them.
Detectron2 is used to detect people on each frame. People are detected each frame separately, not tracking.
Faster R-CNN is the model used for object detection.
Runs only on Google colab