Original paper : https://arxiv.org/abs/1508.06576
Gram matrix is the multiplication of a vector and its transpose.
It is used to find similarity between vectors. In NST similarity between feature channels are calculated with Gram Matrix.