In Quantum Mechanics it is said that the Galileo transformation $$\hat{\mathbf{r}}\mapsto \hat{\mathbf{r}}-\mathbf{v}t\quad \text{and}\quad \hat{\mathbf{p}}\mapsto \hat{\mathbf{p}}-m\mathbf{v}\tag{1}$$ is given by the operator $$\hat{G}(\mathbf{v},t)=\exp\left[\dfrac{i}{\hbar}\mathbf{v}\cdot (m\hat{\mathbf{r}}-\hat{\mathbf{p}}t)\right].\tag{2}$$ Now I want to understand how does one show that this is the operator that implements the Galileo transformation.
I just can't understand, because for me, since we want $\hat{\mathbf{r}}\mapsto \hat{\mathbf{r}} -\mathbf{v}t$ it seems that the operator should just be a translation by $\mathbf{v}t$ which would be
$$\hat{G}(\mathbf{v},t)=\exp\left[\dfrac{i}{\hbar}\hat{\mathbf{p}}\cdot \mathbf{v}t\right]$$
but that's not it. There's also the $m\hat{\mathbf{r}}$ part which I don't understand where it comes from.
I've tried two things: first, defining $\tilde{\psi}(\mathbf{r},t)=\psi(\mathbf{r}+\mathbf{v}t,t)$ to be the transformed wavefunction. It also leads me to the translation only.
The second thing was to define
$$\hat{G}(\mathbf{v},t)=1+\dfrac{i}{\hbar}\hat{\varepsilon}(\mathbf{v},t)$$
with ${\varepsilon}$ infinitesimal and impose the conditions
$$\hat{G}(\mathbf{v},t)^\dagger \hat{\mathbf{R}}(t)\hat{G}(\mathbf{v},t)=\hat{\mathbf{R}}(t)-\mathbf{v}t$$ $$\hat{G}(\mathbf{v},t)^\dagger \hat{\mathbf{P}}(t)\hat{G}(\mathbf{v},t)=\hat{\mathbf{P}}(t)-m\mathbf{v}$$
in terms of the infinitesimal operator this becomes
$$\dfrac{i}{\hbar}[\hat{\mathbf{R}}(t),\hat{\varepsilon}(\mathbf{v},t)]=\mathbf{v}t, \quad\dfrac{i}{\hbar}[\hat{\mathbf{P}}(t),\hat{\varepsilon}(\mathbf{v},t)]=m\mathbf{v}$$
but this doesn't leads very far.
So what is the reasoning behind the $\hat{G}$ usually presented being the operator that implements Galileo transformations?