AN OMNIFOCUS NONFRONTAL IMAGING CAMERA

 

Narendra Ahuja

Donald Biggar Willet Professor of Engineering

Department of Electrical and Computer Engineering

University of Illinois at Urbana-Champaign

ahuja@vision.ai.uiuc.edu, Tel: (217) 333-1837, Fax: (217) 244-8371

 



 WHAT IS OMNIFOCUS CAMERA?


Omnifocus nonfrontal imaging camera, OMNICAM or NICAM, has introduced hitherto nonexistent imaging capabilities, in addition to overcoming some problems with previous methods. NICAM is capable of acquiring seamless panoramic images and range estimates of wide scenes with all objects in focus, regardless of their locations. To understand the impact of NICAM, first consider imaging large scenes with conventional cameras. The camera's field of view is generally much smaller than the entire visual field of interest. Consequently, the camera must pan across the scene of interest, focus on a part at a time, and acquire an image of each part. All the resulting images together then capture the complete scene. As a byproduct of focusing, range-from-focus estimates can also be derived. Usual methods for focusing as well as range estimation from focusing mechanically relocate the sensor plane to vary the focus distance setting in the camera. When a scene point appears in sharp focus, the corresponding depth and focus distance values satisfy the lens law. The depth for the scene point can then be calculated from the focus distance.


The conventional methods therefore involve two mechanical (and hence slow) actions, those of panning  and, for each chosen pan angle, focusing i.e. finding the best focus distance setting. The purpose of the first action is to accumulate data for the entire visual field from the camera's narrower fields of view. This action is therefore essential. An innovation of nonfrontal imaging is in the elimination of the second action. The nonfrontal imaging camera has a sensor plane which is not perpendicular to the optical axis as is standard. This imaging geometry eliminates the time consuming mechanical translation of the sensor plane. Camera panning, required for panoramic viewing anyway, in addition enables focusing. Further, a range-from-focus estimate for each visible scene point is also computed as a by-product of identifying the sharpest image. Thus, from pan motion alone, nonfrontal imaging obtains a composite focused image of all objects/points in a wide scene regardless of their depths, which is in complete registration with a range map obtained in parallel. While it is well known that focus distance control yields both range and focused images, nonfrontal imaging has made it possible for the first time to realize this dual functionality simultaneously for all visible scene points.


Thus, nonfrontal imaging has the following novel capabilities: (i) It provides panoramic (up to 3600) images of a scene without any visible seams. (ii) Each object in an image is in complete focus regardless of its location, i.e., that there is no need to explicitly perform the standard focusing action (accomplished mechanically in "manual" cameras and automatically in "automatic" cameras, but requiring mechanical movement in each case). (iii) Along with the sharp visual image, the camera also delivers the location (coordinates) of each focusable, visible scene point. One consequence of these capabilities is that a single nonfrontal imaging camera can provide stereo pairs of images for three-dimensional, omnifocused, viewing of the entire scene in natural lighting. In fact, this visual 3D experience is even more informative in some ways than natural, human viewing of real world, since humans have finite depth of field while the NICAM driven display shows all parts in focus.


UNIQUENESS

Nonfrontal imaging represents qualitative leaps in what is feasible with the current technology. It makes it possible to achieve hitherto infeasible functionalities and performance levels in imaging. None of the available techniques can deliver seamless focused panoramic images. Most methods choose a scene point/object and bring it into focus by (manually or automatically) controlling sensor location. Thus, they can focus at objects one by one. Similarly, they can estimate range from focus one object/point at a time. The following paragraphs describe the differences between NICAM and the various related existing technologies.

Panoramic Images: In conventional photography, generating panoramic images has been more of an art, pursued by artists who take independent photographs and create a mosaic from them. There are some fundamental problems with this, however. Whenever an image is taken by the camera, a choice of focus distance must be made. Usually, this is done by imaging in focus that object which is of most interest appears at the image center. This means that any objects at other distances in the camera's field will not be in focus. In particular, the image borders may have different amounts of blur along them. Since in general different focus settings are used to obtain photographs of contiguous scene parts, when they are mosaiced to form a panoramic image the discontinuities in the image, sharpness of the scene parts straddling the borders give rise to seams. Thus in the panoramic image neither all objects are imaged in focus nor is the mosaic seamless. One may attempt to alleviate this problem by reducing the size of the camera's visual field, but this does not eliminate the problem because the objects in the scene are not of the same shape as the camera's visual field (e.g., rectangular) so they always straddle across image borders. Of course, the smaller the visual field size the larger is the number of images in the mosaic, which increases the seam density. In fact, a major issue in the construction of panoramic images has been how to process the mosaic to camouflage seams in order to avoid perceptual detection.


Panoramic Cameras: A number of panoramic cameras have been designed over the years for photographic applications. The scene scan is performed by moving the camera mechanically, or pointing it at a special reflector surface such as a conical mirror. To image certain scene points in sharp focus, either the points have to be at a specified depth from the camera, or the depth of field of the camera must be made sufficiently large, e.g., by a combination of reducing the aperture and increasing the focal length. These solutions are not acceptable since they require that the ambient light intensity levels be high or that the scene objects of interest all lie in a narrow depth range.


Panoramic Range Acquisition: Analogous to panoramic image acquisition, panoramic range acquisition methods are not very common. Typically a narrow field of view device is pointed in different directions to obtain a panoramic range image. Or, multiple devices are arranged in a circle such as in sonar sensor rings. Almost all methods are invasive, i.e., they involve scene illumination, e.g., using a laser beam or structured lighting.


Patented Technologies: Most of the relevant previous patents have been filed by Japanese  corporations. However, none of these come close to the NICAM methodology. The closest idea to NICAM was in a patent filed for the Asahi Kogaku Kogyo Kabushiki Kaisha of Tokyo, Japan. In this patent, a camera was described that could select one of three tilt angles between the CCD plane and the optical axis. The application suggested was in photographing one frame of a scene with two subjects. The range to the subjects was determined by an unspecified ``range-determining means''. EMI Limited, England has a patent on using a tilted plane sensor to determine the focus motor drive signal. Some of the other companies that hold patents in slightly related ideas are Olympus Optical Corporation, Japan, Canon Kabushiki Kaisha and Hitachi Ltd, Tokyo, Japan. A camera with a tilted sensor plane and a scanning mirror was also used by a researcher at Jet Propulsion Laboratory to determine the range of scene points.


SAMPLE IMAGES TO ILLUSTRATE PERFORMANCE


Images 1-5 illustrate the performance of NICAM. These images have been obtained using a rough prototype that we have constructed ourselves. There are parts of the images where quality would improve if a professionally built system is used.

 

For the purpose of displaying the panoramic images, typically an image is divided into multiple rows, each showing the view over smaller than the entire angle covered. If the entire length of the panoramic image were printed in a single row, the height will be reduced significantly. To avoid such excessive compression of detail, and to maximize legibility by using all available space, a complete panoramic view is divided into smaller contiguous subangles, and the corresponding subimages are shown in successive rows of the image. Thus, the right end of the row connects to the left end of the following row.


Images 1, 2 and 3 are examples of omnifocused panoramic images acquired using NICAM. Image 1 shows a 1000 view of an outdoor scene, shown split into two rows of 500 each. The objects in the scene are at a range of distances (flowers 1.5', tree 4.5', bench 8', chair 30' and building 50' and larger) but all are imaged in focus and no seams exist across the entire panoramic view. Image 2 shows a 600 view of a room where the distances range from a few feet to about 20 feet. Finally, Image 3 contains a 3600 panoramic view where object distances of 2 to 30 feet from the camera are indicated.


Image 4 demonstrates the omnifocusing performance of NICAM in comparison with the limited depth of field of a regular camera. The upper row in Image 4 (i) shows a 400 (angle chosen by user) omnifocused image acquired by NICAM, and the lower row shows a 200 (angle dictated by the camera) view of the same scene acquired using a regular camera focused at 4' (a choice must be made as to which object to focus on). The progressive loss of focus for objects located closer or farther than the focused depth of 4' can be seen. Image 4 (ii) shows a "panoramic" view constructed from multiple images taken by a regular camera, by concatenating images of contiguous parts of the scene. Since the different images are taken when the camera is focused at different objects, the borders between images give rise to seams. Further, since the camera is focused on a specific object as each image is acquired, not all parts within even a single image are in focus. This should be contrasted with the seamless, panoramic imaging capability of NICAM shown in Images 1-3.


Image 5 (i) demonstrates the impact of the range estimation capability of NICAM. A pair of contiguos planar patches formed by wrapping newspaper on a step-like structure is placed in front of NICAM. The resulting omnifocused panoramic image is shown in the first row. The second row shows the range estimation capability wherein the step structure recovered by NICAM is depicted. The range estimate available for each pixel in the omnifocused image determines the position and irradiance of the corresponding scene point. The omnifocused image and the recovered shape are combined to produce the 3D omnifocused step structure shown through a perspective view in the bottom row. The intensity and range information are combined to produce pseudostereo images as would be acquired by a pair of cameras in place of a single NICAM. Such stereo images when viewed through a stereo mechanism, e.g. stereo glasses, depict the scene in full 3D, using data obtained by a single NICAM! Image 5 (ii) shows such stereo pairs for three scenes. For ease of viewing, the left and right "eye" images have been color coded red and green and overlapped; when viewed through red and green glasses on different eyes, the scenes can be seen in 3D and omnifocused.  The top left scene consists of two planes perpendicular to the line of sight, the nearer one at a distance of 2 ft. from the camera (right plane) and the farther one at a distance of 3 ft. (left plane). The top right scene contains a single chess piece at distance 1 in. The bottom row shows multiple chess pieces placed at three different distances from the camera: 17 in., 20 in. and 25 in. All parts of all scenes are in focus and the 3D structure is visible through stereo viewing. It appears that such 3D depiction using a single visual camera has never been accomplished before!

 


Image 1: 1000 Outdoor View

 

 

 

Image 2: 600 Indoor View

 

 

 

Image 3: 3600 Indoor View

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Image 4(i): 400 Omnifocused View

 

 

 

 

 

Image 4(i): 200 Standard Camera View of the Same Scene as in 4(i), Focused at 4 ft.

 

 

 

 

 

 

 

 

 

Image 4(ii): Regular Camera Mosaic

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Image 5(i): On the following page:

    Image 5(ii)

 



 APPLICATIONS OF NICAM

 

Since NICAM based systems make fundamental improvements in the quality of the acquired images, these systems have a broad appeal for use in the familiar applications of imaging. Further, since NICAM based systems offer hitherto nonexistent capabilities, they are expected to initiate new imaging applications not previously feasible. Some companies, institutions, and government agencies have indicated their plans to use NICAM to design new products and in applications and services, some of which are quite imaginative and interesting. The following paragraphs describe some examples of novel systems made  possible by NICAM.


Photography: Suppose a photographer wishes to capture a scene around the Washington monument. She must make two decisions before she can push the camera button. First, she must decide which part of the scene she wants to capture in the photograph, and then direct the camera to point in the desired direction using the appropriate zoom lens. Second, within the visible scene, she must determine whether the photograph should show the Washington monument in sharp focus, or the trees in front, or the buildings behind, and set the focus control accordingly. The result will be a picture showing, say the monument, in sharp focus, and the lawn to the left and right as well as the trees and the buildings blurred. If she wants to show a wider scene than the camera's field of view, she must take multiple pictures by moving the camera across the scene and then manually ``pasting'' the individual photographs to make a mosaic showing the large scene. Each picture in the mosaic will show a pre-selected object in sharp focus, with consecutive photographs taken, in general, with different focus settings. Consequently, the degree of blur across a picture boundary would change visibly, causing perceptible sharpness transitions across objects that happen to straddle inter-picture borders. Of course, even within each pasted image the objects other than those focused on appear blurred proportional to their relative distances from the focused object. Further, the entire process is very time consuming because of the required mechanical redirecting of cameras, mechanical focusing in each direction, and the subsequent cutting and pasting. Using the NICAM technology, the photographer could capture the Washington monument, the buildings behind, the trees in front, the lawn and objects as far to the left and right as she wants, up to a complete 3600view, all of which would be completely focused in a single photograph.

 

Security and Surveillance: Another major application area is that of security systems and surveillance. For example, consider multiple cameras located at one or more posts outside a building to achieve visibility in all directions. Typically, a guard inside the building would monitor the images delivered by the cameras on separate monitors placed in front of him. The cameras of course show certain objects in focus while others, outside of the depth of field, appear blurred. The guard could control the cameras to focus at different objects in different directions, but that would only result in switching among the areas monitored best. If NICAM is used, a single camera will replace the entire set of cameras and still obtain a focused image of all parts of the scene visible from the watchpost, with no loss of detail due to blurring. This image could be displayed over a surrounding screen, as well as on the usual separate monitors. Further, the guard could also be shown a 3D stereo display of the 3600 surround, using a single NICAM. His display could be viewed in 3D using a headmounted display. Alternatively, the guard could see it on a surrounding 3600 screen, e.g., using stereo glasses, while being able to turn his head in any direction as if he were on the watchpost itself.

 

Analogously, inside a building, a small number of NICAM’s can cover the entire premises to be monitored, instead of a much larger number of normal cameras. For example, a 3600 view of a large building lobby may be covered by a few NICAM’s instead of many ordinary cameras which may still not cover as much depth as NICAM. When located inside a nuclear plant or another hazardous area, clear views of the entire scene can facilitate much faster teleoperation with no adjustment of the camera.


Surgery: In endoscopic or laproscopic surgery, it is a common problem that the surgeon cannot clearly see the interior body structure in the vicinity of the area under operation which leads to imprecision in surgery. A miniature NICAM could image the large, wide space inside the stomach in focus. The surgeon could then perform the operations using NICAM, either with the aid of only the focused images or while stereoscopically viewing the space in 3D and in focus. Another example is neurosurgery. Neurosurgeons today must constantly adjust the focus of the visual aid as they move their tools even by a few millimeters to nearby parts of the brain. This requires constant shifts of hands and attention. With NICAM, the focus will not have to be adjusted so her/his hands can perform surgery continuously without interruptions to adjust focus.


Manufacturing and Navigation: The range estimation capability of NICAM is also useful for automation. A real time NICAM placed on a mobile autonomous vehicle could detect the presence of obstacles. The vehicle could then avoid collisions with them, e.g., as it delivers material across the factory floor. Analogous advantages follow for other mobile robots (mail delivery, household). The reliability of range estimates will vary with object depth in a priori known ways. NICAM could also be used to visually acquire models of objects to be manufactured or modified at speeds much faster than those of laser based scanners which must scan the object a small part at a time. On highways, NICAM could be used by a vehicle to estimate the distances to the nearest objects in all directions, as well as to track moving objects.


Broadcasting: In television reporting, large groups of people (news conferences, football game spectators) could be shown in sharp contrast everywhere using NICAM. This will be an improvement over the current practice where image transmission is switched across one or more cameras, each focused at some object but showing other visible objects out of focus.


Virtual Reality: The 3D display capability of NICAM makes it an excellent candidate for providing omnifocused stereo displays of scenes. Thus, NICAM can be used in virtual environments, e.g., for video games, teleoperation and augmented reality.

 

ILLUSTRATIONS OF APPLICATIONS


Some of the applications in which NICAM can be used are illustrated through Images 6-10 on the following pages. Image 6 illustrates the application of NICAM in remote monitoring of 3D environment for home security. Image 7 demonstrates omnifocused viewing of brain at high magnifications so the surgeon can avoid adjusting the focus and concentrate his mind and hands on the surgery. In Image 8, the NICAM is used as a sensor for locating nearby obstacles and thus avoiding them, and thus for autonomous control of vehicle motion. Image 9 shows how the 3D shape information extracted by NICAM can be supplied to a computer aided design system, and in turn, to numerically controlled machines which can then automatically make a duplicate of the model object. Finally, Image 10 illustrates how NICAM can be used in remote manipulation of hazardous environments. Here, NICAM is located next to a robotic manipulator in a hazardous environment; the images obtained by NICAM are transmitted to a safely located human operator who sees a virtual 3D view of the scene from the robot’s viewpoint; the user manipulates the virtual environment through a virtual manipulator; and the manipulation actions are then transmitted to and followed by the slave manipulator at the hazardous site.

 

 

 

 

 

 

 

 

 

 

 

 

Image 6

 

 

Image 7

 

 

 

          Image 8

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

                Image 9

 

 

 

 

 

 

 

 

 

 

       

    Image 10

 


 

PROTOTYPE DEVELOPMENT

Several prototypes of the camera have been developed. One current implementation with on-board processing is shown in Image 11. The camera has a touch interface, located on the right hand side, that makes the operation easy (as has been demonstrated by the experience of first-time users). The images can be stored and transported using a built-in zip drive seen at the front bottom of the camera in Image 11. The system also contains a large hard disk and ports for connectivity to the outside. The camera has an onboard display so the user can monitor and control the desired beginning and end pan angles. The system needs only a connection to the power line. A remote unit with the camera end separated from the processing unit (e.g., a desktop or a laptop computer) is under development (Fig. 12).


 The NICAM systems we have implemented so far involve panning, and therefore, they require special processing of moving parts of the scene. We have also developed alternative designs that can capture moving objects in omnifocus, and have large dynamic range to capture the brightest and the darkest objects simultaneously. Our implementations so far have used gray scale sensors but we have also verified color imaging ability.

 

 

 

 

 

Image 11: A Current Prototype

 

 

 

Image 12: A Remote prototype under development, with camera separated from the host computer.