Landmark detection by ResNet / Hourglass Model

The final goal of the project is to extact noseprints from dogs with Deep Learning model.
Each noseprints have unique features like fingerprints, so we can use it for identification.
To extract noseprint from certain area in nose, the landmarks, specific points in nose, is essential.
Also, the landmarks are to find whether the dog in the picture is in proper pose or not.
As a first step of the project, let's build a Deep Learning model for landmark detection with ResNet or Hourglass architecture, a powerful algorithm with simple structure.

The project contains:

Training for landmark detection in dog nose using ResNet or Hourglass arhcitecture
Inference with the trained model.

Training

Before training, you need to prepare a dataset in .npy file.
The .npy file should be dictionary type with two keys; 'img', which have values in form of (number of data, 224, 224, 3) and 'landmark', which have values in form of (number of data, 2 * number of landmark).
This is an example.

{
  'img': 
    [
      array([[[159, 196,  74],
        [159, 196,  74],
        [159, 196,  74],
        ...,
        [159, 196,  74],
        [159, 196,  74],
        [159, 196,  74]],

       [[159, 196,  74],
        [159, 196,  74],
        [159, 196,  74],
        ...,
        [159, 196,  74],
        [159, 196,  74],
        [159, 196,  74]],

       [[159, 196,  72],
        [159, 196,  72],
        [159, 196,  72],
        ...,
        [159, 196,  72],
        [160, 195,  74],
        [160, 195,  74]],

       ...,

       [[160, 195,  74],
        [160, 195,  74],
        [160, 196,  72],
        ...,
        [159, 196,  72],
        [159, 196,  72],
        [159, 196,  72]],

       [[162, 195,  74],
        [162, 195,  74],
        [160, 196,  72],
        ...,
        [159, 196,  72],
        [159, 196,  72],
        [159, 196,  72]],

       [[162, 195,  74],
        [162, 195,  74],
        [160, 196,  72],
        ...,
        [159, 196,  72],
        [159, 196,  72],
        [159, 196,  72]]], dtype=uint8), 
      array([[[ 48, 103,  54],
        [ 48, 103,  54],
        [ 52, 101,  57],
        ...,
        [ 52, 101,  57],
        [ 52, 101,  57],
        [ 52, 101,  57]],

       [[ 48, 103,  54],
        [ 50, 102,  54],
        [ 52, 101,  57],
        ...,
        [ 52, 101,  57],
        [ 52, 101,  57],
        [ 52, 101,  57]],

       [[ 50, 102,  54],
        [ 50, 102,  54],
        [ 52, 101,  57],
        ...,
        [ 52, 101,  57],
        [ 52, 101,  57],
        [ 52, 101,  57]],

       ...,

       [[ 52, 101,  57],
        [ 52, 101,  57],
        [ 52, 101,  57],
        ...,
        [ 52, 102,  54],
        [ 52,  99,  60],
        [ 52,  99,  61]],

       [[ 52, 101,  57],
        [ 52, 101,  57],
        [ 52, 101,  57],
        ...,
        [ 52, 102,  54],
        [ 52,  99,  60],
        [ 52,  99,  61]],

       [[ 52, 101,  57],
        [ 52, 101,  57],
        [ 52, 101,  57],
        ...,
        [ 52, 102,  54],
        [ 52,  99,  60],
        [ 52,  99,  61]]], dtype=uint8)
    ], 
  'landmark': 
    [
      array([119, 100, 138, 148, 109, 117, 121, 127, 118, 137, 96, 138, 143, 106, 138, 121, 148, 125, 154, 116], dtype=object), 
      array([118, 101, 138, 149, 108, 116, 123, 127, 119, 139, 95, 136, 142, 108, 134, 117, 142, 125, 152, 115], dtype=object)
    ]
}

After your dataset is ready, load the dataset and decide the architecture of the model.
There are three types of architecture; resnet50, resnet101 and hourglass.
While training, the weights of keras model will be saved in models directory.
The training will terminates after executing all epochs you set, or it ends if the validation loss doesn't get smaller after 20 epochs.

from noseprint import Train

noseprint = Train(dataset='dataset path', architecture='resnet101')
noseprint.train(epochs=100, batch_size=8, model_dir='./models')

Inferencing

After you finish the training, you can check the performance of your model.
The funcion "get_landmarks()" will return an array of landmarks.

from noseprint import Inference

noseprint = Inference(weights='weights path', architecture='resnet101', num_landmarks=10)
landmarks = noseprint.get_landmarks(image='image path')

[[109.880104  79.38798 ]
 [105.11736  126.143135]
 [148.36232   45.564014]
 [100.81828  191.37447 ]
 [ 38.335293 103.12261 ]
 [ 98.82869   46.07913 ]
 [113.325516  85.44805 ]
 [106.412926 184.7699  ]
 [ 89.97804  100.04876 ]
 [130.65442  127.0089  ]]

The funcion "show_landmarks()" will return the image with marks on each landmarks.

import cv2
from noseprint import Inference

noseprint = Inference(weights='weights path', architecture='resnet101', num_landmarks=10)
image = noseprint.show_landmarks(image='image path')
cv2.imshow(image)

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.idea		.idea
image		image
README.md		README.md
hourglass.py		hourglass.py
noseprint.py		noseprint.py
resnet.py		resnet.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Landmark detection by ResNet / Hourglass Model

The project contains:

Training

Inferencing

About

Releases

Packages

Languages

bbabi0901/Noseprint

Folders and files

Latest commit

History

Repository files navigation

Landmark detection by ResNet / Hourglass Model

The project contains:

Training

Inferencing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages