Search code examples
signal-processingsamplingpcm

How does image digitalization differ from sound digitalization (PCM)?


I am trying to understand digitalization of sound and images. As far as I know, they both need to convert analog signal to digital signal. Both should be using sampling and quantization.

  • Sound: We have amplitudes on axis y and time on axis x. What is on axis x and y during image digitalization?

  • What is kind of standard of sample rate for image digitalization? It is used 44kHz for CDs (sound digitalization). How exactly is used sample rate for images?

  • Quantization: Sound - we use bit-depth - which means levels of amplitude - Image: using bit-depth also, but it means how many intesities are we able to recognize? (is it true?)

  • What are other differences between sound and image digitalization?


Solution

  • Acquisition of images can be summarized as a spatial sampling and conversion/quantization steps. The spatial sampling on (x,y) is due to the pixel size. The data (on the third axis, z) is the number of electrons generated by photoelectric effect on the chip. These electrons are converted to ADU (analog digital unit) and then to bits. What is quantized is the light intensity in level of greys, for example data on 8 bits would give 2^8 = 256 levels of gray.

    An image loses information both due to the spatial sampling (resolution) and the intensity quantization (levels of gray).

    Unless you are talking about videos, images won't have sampling in units of Hz (1/time) but in 1/distance. What is important is to verify the Shannon-Nyquist theorem to avoid aliasing. The spatial frequencies you are able to get depend directly on the optical design. The pixel size must be chosen respectively to this design to avoid aliasing.

    EDIT: On the example below I plotted a sine function (white/black stripes). On the left part the signal is correctly sampled, on the right it is undersampled by a factor of 4. It is the same signal, but due to bigger pixels (smaller sampling) you get aliasing of your data. Here the stripes are horizontal, but you also have the same effect for vertical ones.

    enter image description here