Lets say we have a trajectory (positions and momenta) $(x(t), p(t))$ that is the solution of the equation of motion for a system with Hamiltonian $H(x,p)$. For some function $A(x,p)$, the time average is defined as $$\langle A \rangle_{\text{time}} = \lim_{T\to\infty} \frac{1}{T} \int_{0}^{T} \mathrm{d}t A(x(t),p(t)).$$ This assigns a probability $P(M)$ to a set $M \subset \{(x(t),p(t))|t \in [0,\infty)\}$ (via the characteristic function of $M$) that increases with increasing time that the trajectory $(x(t),p(t))$ spends in $M$. I wonder if this induces a probability distribution on the phase space $\{(x,p) | H(x,p) = E\}$ if the trajectory $\{(x(t),p(t))|t \in [0,\infty)\}$ is a dense subset of the phase space. Specifically, does this induce the microcanonical probability density function? I feel it should, given that we know the velocity $v$ the system has at every point of phase space: $v =|\nabla H(x,p)|$.
A common expression for the microcanonical ensemble average is (see, e.g., Schwabl "Statistical Mechanics"): $$\langle A \rangle_{\text{ensemble}} \propto \int_{S}\mathrm{d}S \frac{A(x,p)}{|\nabla H(x,p)|}$$ where $S = \{(x,p)|H(x,p) = E\}$, so every point in the phase space is weighted by the inverse of the velocity of the system at that point, which makes sense. However, this expression is not derived from the time average in the book by Schwabl, and I can't find much more in other books or lecture notes. Any help is appreciated.
Edit: I rephrase my question. Is there a textbook or something like that (lecture notes etc.) that argues in the following way: We want to have $$ \langle A \rangle_{time} = \langle A \rangle_{ensemble} $$ because the left represents what we do in experiments. For this to hold we need to have: $$\langle A \rangle_{\text{ensemble}} \propto \int_{S}\mathrm{d}S \frac{A(x,p)}{|\nabla H(x,p)|}$$ and then continues to give examples for simple systems where this holds and were it doesnt, and maybe even derives under which requirements it holds for complex systems (or is this just impossible). This just seems to me as the more direct way to introduce the microcanical density, especially when considering the justification for the $\delta$ function distribution, e.g., in Schwabl: "it is intuitively plausible that in equilibrium no particular region should play a special role".