If you are machine learner and are working on something related to kernel methods, I am sure most of you are familiar with the so-called **kernel trick**, which is very fundamentally important for most kernel-based learning machines. The equation below gives a formal definition of the kernel trick:

That is, the inner product between the feature map and can be written in term of some positive semidefinite function . This allows one to replace the inner product with the kernel evaluation, and thereby does not need to compute explicitly. Similar to the standard kernel trick, the generalized version can be written as

where is an operator in . Note that the generalized kernel trick reduces to the standard kernel trick when where is the identity operator. Kadri et al. (2012) showed that this trick holds for any implicit mapping of a Mercer kernel given for self-adjoint operator . This is trick particularly useful when deriving the learning algorithm for structured output learning.