How do transformers compare to CNNs in terms of compute budget (and computing time) during inference?

Asked Nov 25 '22 at 08:55

Active Dec 27 '22 at 18:10

Viewed 114 times

Transformers are data and GPU hungry during training. Is this also true at inference time? How do transformers compare to feedforward CNNs e.g., during bounding box generation at inference time? I haven't found a good comparison of computing time and computational resources.

edited Dec 27 '22 at 18:10

asked Nov 25 '22 at 08:55

Mariusmarten

How do transformers compare to CNNs in terms of compute budget (and computing time) during inference?

0 Answers0