A t-design $X$ is defined by the following equation
$$\frac{1}{|X|} \sum_{U \in X} U^{\otimes t} \otimes\left(U^*\right)^{\otimes t}=\int_{U(d)} U^{\otimes t} \otimes\left(U^*\right)^{\otimes t} d U$$
where the right hand side uses the Haar measure for $dU$. Why does this definition make sense compared to simply replacing $U^{\otimes t}$ with $U^t$. I am trying to understand why we invoke the action of $U$ on $t$ copies of the space instead of the action of $U^t$ on one copy of the space. What is the motivation behind that?