0

The following are the two types are projections that are generally used in image processing

  1. Affine transformation
  2. Projective transformation

Affine transformation is a backbone operation in neural networks also. It is expressed as

$$\mathbf{wx+b}$$ where $\mathbf{w, x, b}$ are matrices. In general, $\mathbf{x}$ is treated as an image in image processing.

Projective transformation is also a type of transformation on images and it may be different from affine transformation. I want to know whether it can be represented in terms of mathematical expression.

If yes, what is the expression for projective transformation?

nbro
  • 42,615
  • 12
  • 119
  • 217
hanugm
  • 4,102
  • 3
  • 29
  • 63

1 Answers1

1

I explain in this answer what a projective transformation (aka projectivity or homography) is. It's a function $h$ of the form $$h: \mathbb{P}^2 \rightarrow \mathbb{P}^2,$$ where $\mathbb{P}^2$ is a projective space, so, essentially, a 3-dimensional Euclidean space of homogenous vectors.

You can also represent a homography as a $3 \times 3$ matrix $\mathbf{H}$, so that, when we apply this projective transformation to some input $\mathbf{x} \in \mathbb{P}^2$, we get $\mathbf{x}' \in \mathbb{P}^2$, so we can represent a projective transformation as follows.

$$\mathbf{H}\mathbf{x} = \mathbf{x}'$$

So, basically, a projective transformation is a linear transformation between projective spaces.

You can generalize these ideas to higher-dimensional projective spaces, i.e. $\mathbb{P}^n$.

Although you can represent a projective transformation as a matrix multiplication, there's more to it. In fact, it's a linear transformation, with 8 degrees of freedom, between projective spaces. You can also view a homography as a generalization of other transformations, like isometries, similarities, and affinities. This is explained more in detail in chapter 2 of the book Multiple View Geometry in Computer Vision (2nd edition) Richard Hartley and Andrew Zisserman.

nbro
  • 42,615
  • 12
  • 119
  • 217