However, particle creation is not allowed in quantum mechanics.
what would be a brief outline of how QFT extends QM to allow for particle creation?
There is nothing inherent about QFT that allows for particle creation: instead the important point is that when QFT is mentioned, people are often implicitly referring to relatvistic QFT and it's the introduction of relativity that allows for particle creation.
In textbook applications, QM enjoys Galilean symmetries which imply that mass is a conserved quantity and this fact forbids particle creation.  In textbook applications, QFT instead enjoys the symmetries of special relativity, in which mass is no longer conserved and particle creation is kinematically possible.
This is not a general property of QFTs, though.  There are many examples of non-relativistic QFTs which again forbid particle creation due to the same symmetry constraints which forbid it in QM.
The above is my answer to your direct question.  I can't help but also write about how QFT extends QM in general, below.
General Relationship Between QFT and QM
QFT is a pretty straightforward generalization/extension of QM, despite the fact that it is not often presented this way in textbooks.
In QM, we are tracking degrees of freedom which depend on a single variable: time.  For instance, point particles can be described by their positions $\vec{q}(t)$, from which we can form Hamiltonians $H(\vec{q}, \vec{p})$ and create states $|\Psi\rangle$ which evolve under $H(\vec{q}, \vec{p})$ according to Schrodinger's equation. $|\Psi\rangle$ predicts the probability of finding the system with any particular value of $q$.
In QFT, things are nearly the same.  The only difference is that we are now tracking fields, $\phi(t, x)$ which are functions of both time and space. The rest proceeds as before: from $\phi(t,x)$ we can form Hamiltonians $H(\phi, \pi_\phi)$ and create states $|\Psi\rangle$ which evolve under $H(\phi, \pi_\phi)$ according to Schrodinger's equation. $|\Psi\rangle$ predicts the probability of finding the system with any particular value of $\phi$ for every point in space.
The main reason that QFT is not typically taught this way in textbooks is that the answers to questions of interest in QFT are usually more efficiently computed by other means, i.e. you don't need all the scaffolding of wavefunctions and Schrodinger's equation.  Action- and path-integral-based methods are typically more convenient.  For instance, using the wavefunction is a terrible way to compute the $S$-matrix which describes scattering events.
The one area I know of where so-called "Schrodinger-picture field theory" is used and the computation look roughly like their QM counterparts is cosmology where it falls under the (awesomely) named "wavefunction of the universe" method; see the Hartle-Hawking paper of the same name.
I think this viewpoint is briefly mentioned in Weinberg somewhere (as is almost all of QFT), but I couldn't find the section at the moment.  I know it is discussed fairly extensively in this book by Hatfield.