Architecture

Loss function

1. content loss:

2.style loss

where gram_matrix is being calculated by

3. Total loss

Why we have used gram matrix in style loss

It’s great that we know how to compute the style loss. But you still haven’t been shown “why the style loss is computed using the Gram matrix”. The Gram matrix essentially captures the “distribution of features” of a set of feature maps in a given layer. By trying to minimise the style loss between two images, you are essentially matching the distribution of features between the two images.

So let me take a shot at explaining this a bit more intuitively. Say you have the following feature maps. For simplicity I assume only three feature maps, and two of them are completely inactive. You have one feature map set where the first feature map looks like a dog, and in the second feature map set, the first feature map looks like a dog upside down. Then if you try to manually compute content and style losses, you will get these values. This means that we haven’t lost style information between two feature map sets. However, the content is quite different.

The result will look like this

+ =

content_image + style_image = generated image

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Architecture

Loss function

1. content loss:

2.style loss

where gram_matrix is being calculated by

3. Total loss

Why we have used gram matrix in style loss

The result will look like this

Files

README.md

Latest commit

History

README.md

File metadata and controls

Architecture

Loss function

1. content loss:

2.style loss

where gram_matrix is being calculated by

3. Total loss

Why we have used gram matrix in style loss

The result will look like this