1

I couldn't understand the wording here.

Training for the comparisons

What does "shuffle the comparisons into one dataset" mean?

How does the method they use don't have $K \choose 2$ forward passes for K completions? Do they update $K \choose 2$ in an epoch for K completions or what?

nbro
  • 42,615
  • 12
  • 119
  • 217

0 Answers0