Original paper : https://arxiv.org/abs/1508.06576

Gram matrix is the multiplication of a vector and its transpose.

It is used to find similarity between vectors. In NST similarity between feature channels are calculated with Gram Matrix.