Abstract:
Images are analyzed within a 3D environment that is generated based on spatial relationships of the images and that allows users to experience the images in the 3D environment. Image analysis may include ranking images based on user viewing information, such as the number of users who have viewed an image and how long an image was viewed. Image analysis may further include analyzing the spatial density of images within a 3D environment to determine points of user interest.
Abstract:
The present invention is embodied in a system and method for extracting structure from multiple images of a scene by representing the scene as a group of image layers, including reflection and transparency layers. In general, the present invention performs layer extraction from multiple images containing reflections and transparencies. The present invention includes an optimal approach for recovering layer images and their associated motions from an arbitrary number of composite images. The present invention includes image formation equations, the constrained least squares technique used to recover the component images, a novel method to estimate upper and lower bounds on the solution using min- and max-composites, and a motion refinement method.
Abstract:
An “Image Denoiser” provides a probabilistic process for denoising color images by segmenting an input image into regions, estimating statistics within each region, and then estimating a clean (or denoised) image using a probabilistic model of image formation. In one embodiment, estimated blur between each region is used to reduce artificial sharpening of region boundaries resulting from denoising the input image. In further embodiments, the estimated blur is used for additional purposes, including sharpening edges between one or more regions, and selectively blurring or sharpening one or more specific regions of the image (i.e., “selective focus”) while maintaining the original blurring between the various regions.
Abstract:
A system and process for generating a panoramic video. Essentially, the panoramic video is created by first acquiring multiple videos of the scene being depicted. Preferably, these videos collectively depict a full 360 degree view of the surrounding scene and are captured using a multiple camera rig. The acquisition phase also includes a calibration procedure that provides information about the camera rig used to capture the videos that is used in the next phase for creating the panoramic video. This next phase, which is referred to as the authoring phase, involves mosaicing or stitching individual frames of the videos, which were captured at approximately the same moment in time, to form each frame of the panoramic video. A series of texture maps are then constructed for each frame of the panoramic video. Each texture map coincides with a portion of a prescribed environment model of the scene. The texture map representations of each frame of the panoramic video are encoded so as to facilitate their transfer and viewing. This can include compressing the panoramic video frames. Such a procedure is useful in applications where the panoramic video is to be transferred over a network, such as the Internet.
Abstract:
A system and method for manipulating a set of images of a static scene captured at different exposures (i.e., “bracketed” images) to yield a composite image with improved uniformity in exposure and tone. In general, the aforementioned goal can be achieved by analyzing a set of bracketed images using a multi-dimensional histogram and merging the images via an approach that projects pixels onto a curve that fits the data. However, it has been found that the desired composite image can be produced in a simpler manner by summing the pixel brightness levels across the multiple images, followed by an equalization process. One possible equalization process involves simply averaging the summed pixel brightness values by dividing the summed value of each pixel set (i.e., groups of corresponding pixels from the bracketed images) by the number of bracketed images. An even better result can be achieved using a histogram equalization process. In essence, this histogram equalization involves creating a count of the number of pixels sets having the same summed brightness level. From this count, a cumulative distribution function is computed and normalized to a maximum value corresponding to the maximum summed brightness level. The cumulative distribution function is then used to determine new pixel brightness levels for use in generating the composite image.
Abstract:
A computer-based method and system for digital 3-dimensional imaging of an object which allows for viewing images of the object from arbitrary vantage points. The system, referred to as the Lumigraph system, collects a complete appearance of either a synthetic or real object (or a scene), stores a representation of the appearance, and uses the representation to render images of the object from any vantage point. The appearance of an object is a collection of light rays that emanate from the object in all directions. The system stores the representation of the appearance as a set of coefficients of a 4-dimensional function, referred to as the Lumigraph function. From the Lumigraph function with these coefficients, the Lumigraph system can generate 2-dimensional images of the object from any vantage point. The Lumigraph system generates an image by evaluating the Lumigraph function to identify the intensity values of light rays that would emanate from the object to form the image. The Lumigraph system then combines these intensity values to form the image.
Abstract:
The described implementations relate to deblurring images. One system includes an imaging device configured to capture an image, a linear motion detector and a rotational motion detector. This system also includes a controller configured to receive a signal from the imaging device relating to capture of the image and to responsively cause the linear motion detector and the rotational motion detector to detect motion-related information. Finally, this particular system includes a motion calculator configured to recover camera motion associated with the image based upon the detected motion-related information and to infer imaging device motion induced blur of the image and an image deblurring component configured to reduce imaging device induced blur from the image utilizing the inferred camera motion induced blur.
Abstract:
A local bi-gram model object recognition system and method for constructing a local bi-gram model and using the model to recognize objects in a query image. In a learning phase, the local bi-gram model is constructed that represents objects found in a set of training images. The local bi-gram model is a local spatial model that only models the relationship of neighboring features without any knowledge of their global context. Object recognition is performed by finding a set of matching primitives in the query image. A tree structure of matching primitives is generated and a search is performed to find a tree structure of matching primitives that obeys the local bi-gram model. The local bi-gram model can be found using unsupervised learning. The system and method also can be used to recognize objects unsupervised that are undergoing non-rigid transformations for both object instance recognition and category recognition.
Abstract:
Techniques and systems are disclosed for navigating human scale image data using aligned perspective images. A consecutive sequence of digital images is stacked together by aligning consecutive images laterally with an image offset between edges of consecutive images corresponding to a distance between respective view windows of the consecutive images. A view window of an image in the sequence is rendered, where the view window of the image corresponds to a desired location. Offset portions of the view window of a desired number of images in the sequence are rendered, for example, alongside the full view of the image at the desired location.
Abstract:
A Bayesian two-color image demosaicer and method for processing a digital color image to demosaic the image in such a way as to reduce image artifacts. The method and system are an improvement on and an enhancement to previous demosaicing techniques. A preliminary demosaicing pass is performed on the image to assign each pixel a fully specified RGB triple color value. The final color value of pixel in the processed image is restricted to be a linear combination of two colors. Fully-specified RGB triple color values for each pixel in an image used to find two clusters represented favored two colors. The amount of contribution from these favored two colors on the final color value then is determined. The method and system also can process multiple images to improve the demosaicing results. When using multiple images, sampling can be performed at a finer resolution, known as super resolution.