Let $\phi$ denote a field consisting of creation and annihilation operators. In physics, the Wick ordering of $\phi$, denoted $:\phi:$, is defined so that all creation are to the left of all annihilation operators. This is the definition given in many physics textbooks and also on the normal order Wikipedia page. The reason for introducing this ordering is to have a finite expectation value for the vacuum (e.g. the Casimir effect).
In mathematically rigorous treatments of quantum field theory Wick ordering is given as a probabilistic tool and is defined recursively. For example on p.9 in Simon's book on Euclidean quantum field theory he says:
Let $f$ be a random variable with finite moments. Then $:f^n:$, $n = 0, 1, \ldots$ is defined recursively by: $$:f^0: ~=~ 1 \tag{I.14a} $$ $$ \frac{\partial}{\partial f} :f^n: ~=~ n : f^{n-1}: \qquad n = 1, 2, \ldots \tag{I.14b} $$ $$ \langle :f^n: \rangle ~=~ 0 \qquad n = 1, 2, \ldots \tag{I.14c}$$
I have also seen this arXiv paper by Wurum and Berg titles "Wick Calculus" and this MathOverflow question. They give some brief motivation and give an analogy between Wick ordering and Hermite polynomials, but I am struggling to see how the probabilistic definition of Wick ordering given above has anything to do with the physicists idea of putting creating operators before annihilation operators. How does this version of Wick ordering help make things finite?