The DOF problem in iris acquisition systems

Posted on February 9, 2015 by Indranil Sinharoy

This is the third post in the series on Iris acquisition for biometrics. In the first and the second posts we saw that, at least in theory, iris recognition is an ideal biometric, and we went through some of the desirable properties of an iris acquisition system. However, currently most iris recognition systems require a single subject to stand (or move slowly) at a certain standoff distance from the camera in order capture and process iris images. Wouldn’t it be nice if iris recognition could be simultaneously performed for a group of people who may be standing/ moving within a large volume? Such systems could potentially be used in crowded places such as airports, stadiums, railway stations etc.

In this post, we will look at one of the limitations of current iris recognition systems – the limited depth of field, the fundamental cause of this limitation, and how some of the current systems are addressing this problem.

The problem of DOF

The inability of any conventional imaging system to capture sharp images within a large volume is illustrated in the Figure 1.

Figure 1 Depth of field (DOF) problem. Image of the three human-figure cut-outs with sinusoidal patterns (2 lp/mm) and artificial irises and placed apart by 11 cm from each other. The camera, with lens of 80 mm focal length and f/5 aperture, was focused on the middle cut-out (3.6 meters away from the camera). It is evident that the spatial resolution in the image falls off rapidly with increasing distance from the plane of sharp focus (middle cut-out) inhibiting the camera from resolving fine details uniformly across the imaging volume.

Perfect imaging corresponds to the ability of an imager to produce a scaled replica of an object in the image space [1]. When only a small portion of the light wave emerging from an infinitesimally small point source of light is collected through a finite opening of a camera’s aperture (Figure 2 (a)), the replica in the image space is not exact even in the absence of aberrations; instead, the image of the point spreads out in space due to diffraction at the aperture. This dispersed response in the three-dimensional image space is called Point Spread Function (PSF). The spreading of the PSF along the transverse (xy-axis) direction (a 2D PSF) restricts an imager’s ability to resolve fine details (spatial frequency) in the image. For an extended object, which is made of several points, the 2D PSF smears the responses from neighboring points into each other causing blur. Similarly, the spread along the longitudinal direction (z-axis) limits the ability to discriminate points staggered closely in the direction of the optical axis causing a region of uncertainty; however, the extension of the 3D PSF along the optical axis enables multiple spatially-separated objects (or points) within a volume in the object space to form acceptably sharp images at once. Conversely, an (point) object in the object space may be placed anywhere within this zone and still form a satisfactory image. This zone of tolerance in the object space is called depth of field. The corresponding zone in the image space is called depth of focus [2]. In this post, the acronym “DOF” is used for both depth of field and depth of focus wherever its meaning is apparent from the context. In the image space, the DOF is defined as the region of the 3D PSF where the intensity is above 80% of the central maximum [3,4]. This zone is in the shape of a prolate spheroid. In the absence of aberrations, the maximum intensity occurs at the geometric focal point, $z_g$ , where contributions from all parts of the pupil are all in phase. Figure 2 (b) shows the aberration-free intensity distribution, $I_n(r, \delta z)$ , as a function of defocus $\delta z = z_i - z_g$ about the geometric focal point for a light source placed at 100 millimeters from a lens of focal-length of 25 mm and aperture diameter of 5 mm. The expression for the distribution—normalized to make $I_n(0,0)$ equal to unity—is obtained using scalar diffraction theory and paraxial assumptions.

$Figure 2 Incoherent impulse response and DOF. (a) The image A’ of a point source A spreads out in space forming a zone of tolerance called Depth of Focus (DOF) in the image space; (b) The normalized focal intensity distribution of the 3D PSF of a 25mm, f/5 lens imaging an axial point source at a distance of 100mm. The expression for the 3D PSF was obtained for a circular aperture using scalar diffraction theory and paraxial assumption. The DOF, having prolate spheroidal shape, is defined as the region within which the intensity has above 80% of the intensity at the geometric focus point. The figure shows iso-surfaces representing 0.8, 0.2, 0.05 and 0.01 intensity levels. The ticks on the left vertical side indicate the locations of the first zeroes of the Airy pattern in the focal plane. The vertical axis has been exaggerated by 10 times in order to improve the display of the distribution.$

Figure 2 Incoherent impulse response and DOF. (a) The image A’ of a point source A spreads out in space forming a zone of tolerance called Depth of Focus (DOF) in the image space; (b) The normalized focal intensity distribution of the 3D PSF of a 25mm, f/5 lens imaging an axial point source at a distance of 100mm. The expression for the 3D PSF was obtained for a circular aperture using scalar diffraction theory and paraxial assumption. The DOF, having prolate spheroidal shape, is defined as the region within which the intensity has above 80% of the intensity at the geometric focus point. The figure shows iso-surfaces representing 0.8, 0.2, 0.05 and 0.01 intensity levels. The ticks on the left vertical side indicate the locations of the first zeroes of the Airy pattern in the focal plane. The vertical axis has been exaggerated by 10 times in order to improve the display of the distribution.

The shape—length and breadth—of the 80% intensity region (Figure 2(b)) dictates the quality of the image acquired by an imager in terms of lateral spatial resolution and DOF.

Continue reading →

Ambiguity function (AF) and its use in OTF analysis

Posted on January 25, 2015 by Indranil Sinharoy

The 2D Ambiguity Function (AF) and its relation to 1D Optical Transfer Function (OTF)

The Ambiguity Function (AF) is an useful tool for optical system analysis. This post is a basic introduction to AF, and how it can be useful for analyzing incoherent optical systems. We will see that the AF simultaneously contains all the OTFs associated with an rectangularly separable incoherent optical system with varying degree of defocus [2-4]. Thus by inspecting the AF of an optical system, one can easily predict the performance of the system in the presence of defocus. It has been used in the design of extended-depth-of-field cubic phase mask system.

NOTE:

This post was created using an IPython notebook. The most recent version of the IPython notebook can be found here.

To understand the basic theory, we shall consider a one-dimensional pupil function, which is defined as:

$(1) \hspace{40pt} P(x) = \begin{cases} 1 & \text{if } |x| \leq 1, \\ 0 & \text{if } |x| > 1, \end{cases}$

The *generalized pupil function* associated with $P(x)$ is the complex function $\mathcal{P}(x)$ given by the expression [1]:

$(2) \hspace{40pt} \mathcal{P}(x) = P(x)e^{jkW(x)}$

where $W(x)$ is the aberration function. Then, the amplitude PSF of an aberrated optical system is the Fraunhofer diffraction pattern (Fourier transform with the frequency variable $f_x$ equal to $x/\lambda z_i$ ) of the generalized pupil function, and the intensity PSF is the squared magnitude of the amplitude PSF [1]. Note that $z_i$ is the distance between the diffraction pattern/screen and the aperture/pupil.

Continue reading →

Progression of pixel resolution in digital cameras

Posted on February 9, 2014 by Indranil Sinharoy

Thanks to the megapixel war, pixels in digital sensors have shrunk considerably over the years. Consequently, the pixel resolution (number of pixels in a digital image) has improved. Currently, the pixel resolution (assuming gray-scale sensor) can compete with the aerial resolution of lenses —on paper. This post is about a plot I created sometime back (for a different presentation) which shows the growth of pixel count and the diminishing of pixel size over the years for three popular segments of digital cameras. The green line plots the diffraction-limited (aberration free) optical resolution 3 stops below the maximum aperture available for off-the-shelf lenses during the same period. The optical resolution line doesn’t mean much; however, it is plotted to compare the sensor resolution with the optical resolution over time. The graph shows that while the sensor resolution has improved by leaps and bounds, the optical resolution hasn’t. It is no surprise though, because the optical resolution, which is limited by the fundamental nature of light —diffraction. Improving the optical resolution by traditional means is very expensive, and results in bulky lenses. The time is just right for exploring computational methods for improving the system resolution of imaging systems.

Other interesting data-points in the graphs are:

1. The Kodak DSC460, based on a Nikon SLR, was one of the first digital cameras.

2. The Sharp J-SH04 was the cellphone with a camera.

The number of mega-pixels and the pixel resolution (decrease in pixel size) has increased rapidly for the cellphone and point-and-shoot cameras, probably driven by marketing rather than by picture quality. In the more professional segment, clearly the strategy has been different. This may be because of two main reasons — one, the image quality dictated by noise, color reproduction, low-light performance, etc are more important for these shooters, and two, building high-quality large lens is relatively more expensive.

Indranil's world

Indranil's page on Imaging, Optics, Computer Vision, Python & Photography

Category Archives: Imaging systems