The model is based on the code of HRNet semantic segmentation with GCCPM pose estimation model keypoints refinemen stage. The core of the model used in the challenge is HRNetV2-w48 backbone.
The model predicts 57 keypoints, depicted below. The target tensor consisted of a 2D heatmap for each point, where Gaussian peaks were positioned at the keypoint locations. Locations of the keypoints, including the approach to refine lines intersections localtions and ellipses tangent points are computed using code in /src/datatools/.
EDA notebook contains visualization of the raw annotations and the derived keypoints, utilized in the model training process.
Ellipse notebook contains visualization of the ellipse-line intersection and ellipse tangent point calculation algorithms used in construction of the targets.
An additional target channel was included, which represented the inverse of the maximal value among the other target feature maps. This ensured that the final target tensor summed up to 1.0 at each spatial point.
The pitch pattern with all the points depicted. Red - line-line intersection, blue - line-conic intersection, purple - conic tangent point, dark-green - other points projected by homography.
The details on the model, training procedure and transforms are available in the blog post.
Papers: