5

I know that a camera does not directly measure the immediate value of the electromagnetic field's vector, but rather the average value of a derived quantity (a statistic of the EM field). Perhaps this quantity is the light's Intensity $\left[W/m^2\right]$?

By the way, does the camera measure the intensity of the electric field alone, or the intensity of the magnetic field alone, or the sum of both the electric and the magnetic component?

Assume that there is a monochromatic camera and a single monochromatic light source. The camera takes a picture in the RAW format so there is no non-linear post-processing of the captured image and each pixel represents a single 14-bit number. Let's focus on a single pixel $p$ with value $p = 478$. I would like to understand where does the number 478 come from. Surely, the pixel $p$ must have the value $478$ because several light rays (EM waves) hit it.

My guess is that the value $478$ is a normalized (in the range $[0; 2^{14}-1]$) measurement of the EM field's Intensity in the space occupied by the pixel (so the units are Watts per unit area). The lower and upper bounds of the pixel value ($[0; 2^{14}-1]$) correspond to specific lower and upper bounds of absolute Intensity $\left[W/m^2\right]$. The two absolute values of Intensity then perhaps depend on the camera's parameters (like ISO and exposure time).

How can I mathematically describe the relation between the underlying EM field and the final pixel value? In other words: how should I transform the EM field to get the value 478? I want to be able to compute the pixel value that a camera would output for a specific pixel given the knowledge of the underlying EM field and given the camera settings (ISO, exposure, etc.).

Imagine a light ray (i.e. a light wave traveling directly to the pixel $p$), which is an EM wave. Let's describe just the electric field's value along a ray (a vector per each point on the ray, consisting of nonzero X and Y components orthogonal to direction of the propagation of the wave): $$ E_i(t, x) = E_{0i} * \cos\left( \omega t - kx + \delta_i \right) $$ where:

  • $t$ is the time,
  • $x$ is the distance (position) along the ray,
  • $i$ is either the X or the Y component of the electric field wave's vector (both orthogonal to the wave's propagation),
  • $E_{0i}$ is the amplitude of the $i$ component of the electric field $\left[N/C\right]$,
  • $\omega t - kx$ is the time-space propagator of the wave,
  • $\delta_i$ is the phase offset of the $i$ component of the electric field.

I would like to know how to define a hypothetical transformation $T(E(t, x)) = 478$ explained in terms of waves of the EM field, not in terms of photons (assuming that is even possible).

P.S. If there are any books, papers or blogposts that explain this topic in more detail, please share a link. Thank you.

jordi
  • 61

2 Answers2

8

Disclaimers

  1. It is not possible to give a quantitative explanation of the relationship between incident electromagnetic energy and the digital ADU readings of a camera pixel without a discussion of the quantization of the energy in the electromagnetic field.
  2. My semi-conductor physics is a bit poor so that part of the answer may be lacking. Perhaps other answers or comments can help clarify anything I get wrong or explain poorly.

Photodetectors

I'm going to talk about charge coupled device (CCD) sensors.

A single pixel on a CCD sensor includes a semiconducting device which produces charges when it is driven by electric fields from electromagnetic radiation. The device includes a photodetection region and an electron storage region. My vague understanding is that the photodetection region is a depletion region of some semiconducting interface. When electromagnetic fields impinge upon this region, the oscillating electric fields excite an electron/hole pair from their bound valence bands into their respective conducting bands. This photodetection region is in a DC bias field produced by a gate electrode. When the electron/hold pairs occupy the mobile conduction bands, and in the presence of the gating electric field, the electrons are swept into a different region of the device used for charge storage. This region is essentially a capacitor which is used to store charge. The electrons which are excited and stored in this process are often called photoelectrons.

Note that there are two electric fields. One is the electric field of the incident electromagnetic radiation. This is what is being detected. The second is the static electric field internal to the semiconducting device due to the gating. These two electric fields should not be confused. Note that this already answers a part of the question: It is the oscillating electric field from the incident electromagnetic radiation that is being detected by a camera sensor. The incident oscillating magnetic field is unrelated.

Quantitative Photodetection - How Many Photoelectrons are There?

We now know that the electric field from the incident electromagnetic field excites electrons into the conduction band of the semiconductor and that this is the first step in photodetection. But what is the quantitative relationship between the incident electromagnetic field amplitude/energy and the number of excited photoelectrons?

In short, the answer is that energy in the electromagnetic field of frequency $f$ is quantized in units of $hf$. For a unity quantum efficiency photodetector the number of photoelectrons is equal to

$$ N = IAT/hf $$

where

  • $N$ is the number of photoelectrons,
  • $I$ is the incident optical intensity (in units of $\text{J}/\text{m}^2/\text{s}$, for example),
  • $A$ is the detector area (in units of $\text{m}^2$, for example),
  • $T$ is the exposure time (in units of $\text{s}$, for example),
  • $h$ is Planck's constant, and
  • $f$ is the optical frequency of the incident light.

The number of photoelectrons is exactly equal to the number of quanta absorbed from the electromagnetic field. I think it is helpful to see the answer up front, however, below, I will give a more thorough discussion of the photoelectric effect.

The Photoelectric Effect

The OP requested "I would like to know how to define a hypothetical transformation $T(E(t,x))=478$ explained in terms of waves of the EM field, not in terms of photons (assuming that is even possible).". As seen above, it is not possible to give the complete story without making reference to the quantization of the energy fo the electromagnetic field. Because of this we will go into some detail about the photoelectric effect, so consider researching that further if necessary.

Below we will discuss the photoelectric effect as it pertains to the ejection of electrons from a metal. The basic ideas will carry over to the excitation of bound electrons from their valence bands into the conduction bands in the semiconducting photodetector.

For an electron to be excited, it is necessary for the electron to absorb energy from the electromagnetic field. For monochromatic light (of sufficiently high frequency), it is the case that the number of electrons excited (number) is proportional to the integrated optical energy over the detector active area and the exposure time. The integrated optical energy is the product

$$ E = IAT $$

where $I$ is the optical intensity, $A$ is the detector area, and $T$ is the optical exposure time.

However, surprisingly from the point of view of classical physics, if you vary the frequency of the incident electromagnetic radiation, you find that the number of photoelectrons generated has a non-trivial dependence on the frequency.

Consider illuminating a metal surface with electromagnetic radiation of a given frequency $f$ (specified in THz) and with a given integrated energy $E$ (specified in $\text{aJ}$, attojoules). Below I show a crude classical prediction for the number of ejected electrons and the quantum prediction for the number of ejected electrons.

The Photoelectric Effect

Classically, we may have expected the plot on the left. We may have expected that as we increase the integrated energy that the number of photoelectrons increases linearly and that the number of photoelectrons is independent from the incident optical frequency. However, in experiment, we see the plot on the right. We see that below a certain cutoff energy (10.4 THz in this case, stolen from the Wikipedia page example for Zn) no photoelectrons are ejected. Above the cutoff frequency we see that increasing the integrated energy does increase the number of photoelectrons, however, the conversion factor between integrated energy and number of photoelectrons varies with wavelength.

Indeed, the interpretation here, provided by Einstein in 1905, is that the energy of the electromagnetic field is quantized in units of $hf$ where $h$ is Planck's constant and $f$ is the optical frequency. The idea is that an electron can absorb at minimum one quanta $hf$ of energy from the electromagnetic field. In this case if the integrated energy is $E$ then the number of photoelectrons is given by

$$ N = E/hf = IAT/hf $$

Note that the cutoff frequency $f_{\text{cutoff}}$ is related to a minimum energy $\phi = hf_{\text{cutoff}}$ necessary to eject an electron from the metal. If an electron absorbs $hf$ energy then the excess energy $hf - hf_{\text{cutoff}}$ shows up as kinetic energy of the electron.

I claim that it is highly unlikely for $n$ quanta of energy $hf$ from the electromagnetic field to excite $m$ electrons from the material unless $n=m=1$. A convincing discussion of this is beyond the scope of this answer. For purposes of this answer we will assume that one electron is always excited as a result of absorbing $hf$ energy from the electromagnetic field, noting that some of the energy goes to unbinding the electron and the leftover energy appears as kinetic energy.

The conclusions from this sections are

  • The energy of the electromagnetic field is quantized into units $hf$
  • Exactly one electron is always excited by exactly one quanta $hf$ of energy from the electromagnetic field
  • If the incident intensity is $I$, the photodetector has area $A$, and the exposure time is $T$, then the integrated optical energy on the photodetector is $E=IAT$
  • The number of excited electrons is the integrated energy $E=IAT$ divided by the energy quanta $hf$: $N = IAT/hf$.

Finite Quantum Efficiency

Note that in practice it is not the case every quanta $hf$ from the integrated optical energy $E$ will cause an excitation of a photoelectron. There may be a variety of mechanisms by which optical energy does NOT result in photoelectron generation.

  • The light may fall on a region of the detector which does is not optically active
  • The light may be absorbed by some other mechanism than direct photoelectron generation (for example generating thermal excitations)
  • The light may reflect from the optical surface
  • etc.

These effects are captured by the photodetector quantum efficiency

$$ 0 < \epsilon_Q < 1 $$

Taking into account sub-unity quantum efficiency, the number of excited photoelectrons is calculated by

$$ N = \epsilon_Q I A T / hf $$

Counting Photoelectrons

The rest of the conversion from optical fields to digital counts is essentially classical electronics.

As discussed previously, the photoelectrons where excited into the semi-conductor conduction band and swept by the bias field into a storage region. Each pixel can hold a maximum number of photoelectrons before the photoelectrons start "falling out" of the storage region. I think of this as a bucket overflowing. The number of photoelectrons that can be stored before the bucket starts overflowing is related to the well-depth of the pixel, or the integrated energies at which we start to see effects like pixel saturation, bleaching, or bleeding. We will assume we are operating well below this point in the linear operating regime of the photodetector.

In a CCD sensor the "buckets" of pixels are shuttled around the sensor using various gating sequences to carry the buckets of electrons around the sensor. Eventually this "dance" delivers the electrons to a charge to voltage converter. One simple way to model this is that the charges are collected onto a capacitor. The capacitor then produces a voltage $V = Q/C = Ne/C$ where $N$ is the number of photoelectrons, $e$ is the electron charge, $Q=Ne$ is the total charge, and $C$ is the capacitance.

This voltage can then be read out on an analog-to-digital converter (ADC) which takes an analog voltage $V$ and converts it to a binary representation of an integer value. If the ADC has a maximum voltage $V_0$ and $d$ bits then the ADC value reading for $V$ will be

$$ D = \text{floor}\left(V/V_0 \times 2^d\right) $$

I might have an off-by-one error in this formula.

Note that, in practice, between the charge storage and the ADC, there will likely be a charge and/or voltage amplifier to boost the otherwise tiny electrical signals. We will signify the combined gain of these stages by $G$ so that the voltage $V$ input at the ADC is given by

$$ V = GNe/C $$

Putting it all together

Starting to put it all together. We can now start putting formulas together to get from properties of the incident electromagnetic field to the digital counts recorded by a camera pixel.

\begin{align*} D =& \text{floor}\left(V/V_0\times2^d\right)\\ =& \text{floor}\left(GNe/CV_0\times 2^d\right)\\ =& \text{floor}\left(\frac{\epsilon_Q IAT}{hf}\frac{Ge}{CV_0}2^d\right) \end{align*}

This is essentially the final answer except we can do one more step of work relating the incident intensity $I$ to the electric field amplitude $\mathcal{E}$. For linearly polarized light, the optical intensity $I$ is related to the electric field amplitude $\mathcal{E}$ by $$ I = \frac{\epsilon_0 c}{2} |\mathcal{E}|^2 $$ where $c$ is the speed of light and $\epsilon_0$ is the permittivity of free space. For circular or elliptically polarized light you may need to take some care thinking about how exactly you define the electric field amplitude to get the pre-factor on this formula right.

We can finally put everything together to get the final result

$$ D = \text{floor}\left(\frac{\epsilon_0c}{2}|\mathcal{E}|^2\frac{\epsilon_Q AT}{hf}\frac{Ge}{CV_0}2^d\right) $$ where

  • $D$ is the digital value output on the camera pixel,
  • $\epsilon_0$ is the permittivity of free space,
  • $c$ is the speed of light,
  • $\mathcal{E}$ is the electric field intensity,
  • $\epsilon_Q$ is the detector quantum efficiency,
  • $A$ is the pixel area,
  • $T$ is the exposure time,
  • $h$ is Planck's constant,
  • $f$ is the optical frequency of the incident light,
  • $G$ is the combined charge and voltage gain in the CCD sensor,
  • $e$ is the electron charge,
  • $C$ is the charge-to-voltage converter capacitance,
  • $V_0$ is the ADC reference voltage and
  • $d$ is the bit-resolution of the ADC.

Some further notes/disclaimers

Note that this is in some ways a very crude model of the CCD sensor detection chain. Many of the details have probably been botched with respect to how actual CCD sensors work. However, I think this crude model does capture many of the important features and at least gives a gist for the physics/signal processing happening and also gives a satisfactory answer to this question.

Also note that, in more detailed theory, the relationship between the incident optical intensity and the number of photoelectrons is actually a statistical relationship involving Poissonian shot noise. That is, for the same incident optical field but with repeated exposures, you will get different numbers of photoelectrons and different number of digital counts. This is related to "fundamental" quantum randomness. However, if the intensity is high enough, the signal-to-noise ratio becomes very good and we can ignore the statistical uncertainty in the number of photoelectrons. Also, unless the application is highly technical, it is likely the case the electronic noise from the amplification stages dominates quantum shot noise.

Direct answers to questions in the OP

I know that a camera does not directly measure the immediate value of the electromagnetic field's vector, but rather the average value of a derived quantity (a statistic of the EM field). Perhaps this quantity is the light's Intensity $[W/m^2]?$

The value recorded for a pixel is related to the intensity of the electromagnetic radiation incident on that pixel during the exposure time. The recorded value is proportional to the intensity on the pixel $I$, the pixel area $A$, and the exposure time $T$.

By the way, does the camera measure the intensity of the electric field alone, or the intensity of the magnetic field alone, or the sum of both the electric and the magnetic component?

The photoelectrons are technically excited by the electric field from the incident electromagnetic radiation. For this type of photoexcitation, the magnetic field is irrelevant. It is worth noting, as an aside, that the energy in electromagnetic radiation in free space is evenly split between electric and magnetic energy, so if you know any of the electric energy, the magnetic energy, or the electromagnetic energy you can figure out the other two.

How can I mathematically describe the relation between the underlying EM field and the final pixel value?

...

I would like to know how to define a hypothetical transformation $T(E(t,x))=478$ explained in terms of waves of the EM field, not in terms of photons (assuming that is even possible).

See the formula above as the main part of this answer. But note that we couldn't avoid a discussion of the quantization of the electromagnetic field energy. I gave the entire answer above without using the word "photon" because that word can be problematic. My definition of a "photon" is "a quantized excitation (of energy $hf$) of the electromagnetic field". In the description above you can easily see where I could have instead used the word "photon". For example, we can summarize the relationship between the number of photoelectrons and the incident electromagnetic radiation by saying "The number of photoelectrons is equal to the number of photons collected on the detector with area $A$ during exposure time $T$ multiplied by the detector quantum efficiency $\epsilon$". I will note that my definition of photon is still in terms of the electromagnetic field. This is contrasted with other definitions which involve the idea of a particle. I think my definition is better suited to answer what was requested in this question.

P.S. If there are any books, papers or blogposts that explain this topic in more detail, please share a link. Thank you.

The Handbook of Biological Confocal Microscopy, Third Edition edited by James Prawley has an excellent appendix titled "More Than You Ever Really Wanted To Know About Charge-Coupled Devices" which explains much of what is described in this answer in better detail. You may be able to find it online. I highly recommend this as a resource. There is a lot that I wanted to know about CCDs but, true to the title of this appendix, it has more info than even I wanted to know!

Jagerber48
  • 16,234
1

Consider a wave of light entering the lens of a camera. The lens refracts and reflects the light based on the wavelength (and incident angle) of the light. The responses are mediated by the index of refraction of the medium the carries the light (air) and the lens itself. The lens may absorb a portion of the light depending on whether the wavelength is at a frequency that resonates with specific modes (electronic, ionic, and molecular) in the material. The absorption is controlled by absorption coefficient of the lens material. Finally, the lens may accept only a specific polarization vector of the incident light. Finally, whatever amount of light that is transmitted through the lens is refracted as it leaves the lens.

You control the wavelengths and polarizations of light that get through the camera to reach its detector by controlling the quality of the lens. You control the spot size of the light (among other things) at the detector by setting the aperture. You control the integrated total intensity of light (the total number of photons) that reaches the detector by controlling the exposure time.

Camera detectors detect light because a photo-excitation event happens inside of them. The excitation happens due to the photon of light having at least if not exactly the required energy (frequency). Camera detectors count the number photo-excitation events that occur. The detectors count the number of photons per unit time on the pixel that are incident with the required energy to cause the photo-excitation event. The transfer map is to set pixel intensity as being proportional to number of incident photons. The number of incident photons is mapped as the amplitude squared of the light wave equation.

The physical pixel has an efficiency factor to transform one incident photon into one measured event. The factor is associated with an excitation cross-section (units for cross section are typically as area).

In summary, to obtain a transfer matrix to map light waves impinging on a pixel in a detector to the pixel output value, select the frequency required by the physical pixel to initiate a photo-excitation event inside of it, use the amplitude squared at that frequency to establish the intensity (number of photons per time) impacting on the pixel, and apply the requisite pixel efficiency factor (cross section) to map light intensity to pixel output value.